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TRANSACTIVATION SYSTEM FOR MAMMALIAN CELLS 
I. Introduction 

A. Field of the Invention 

[0001] The present invention relates to the enhancement of recombinant protein expression in a 
mammalian host cell. More specifically, the invention relates to the enhancement of recombinant 
protein production by reducing apoptosis in a population of cells that contain a transcriptional 
activator (transactivator) protein that enhances gene expression of the recombinant protein. 

B. Background of the Invention 

[0002] Recombinant engineered antibodies and other recombinant human glycoproteins are 
typically produced by transfection into mammalian cells of genes. Available expression systems 
for antibodies and other complex proteins generally involve the use of vectors that integrate into 
the host cell genome. These vectors typically integrate at essentially random sites in the genome 
and the level of expression is profoundly influenced by the site of integration and the chromatin 
structure at the site. This fact is generally regarded as limiting the maximum expression that can 
be achieved from each copy of the vector and leads to a great deal of variability in the 
productivity of different transfectant clones. The most efficient mammalian expression systems 
make use of a selectable marlcer that can be subjected to progressively more stringent selection 
conditions in order to select for transfectants that express at the highest possible levels from the 
integrated vector sequences or have undergone gene amplification. Increasing the number of 
vector copies at the site of integration typically leads to concomitant increase in the productivity 
of the cell line. Two such amplifiable marlcers have been widely used: dihydrofolate reductase 
(DHFR; see for example US patents 5,179,017 and 6,455,275) and glutamine synthetase (GS; 
Bebbington et al (1992) Bio/Technology 10, 169-175; US patents 5,591,693; 5,827,739; 
5,770,359; 5,747,308; 5,122,464). 

[0003] Cell-hne development using such systems is time consuming and labor-intensive because 
of the need to screen a large number of transfectants to identify rare clones in which the vector 
has integrated into a favorable site in the genome for transcription. The available gene- 
amplification systems also suffer from the major drawback of being limited to use in a restricted 
number of cell-types. In practice, the DHFR system has been largely limited to use in DHFR- 
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minus mutant CHO cells; and GS selection has been most widely used in the NSO mouse 
myeloma line. 

[0004] Other cell types may have advantages in temis of growth at large scale in simple defined 
culture media but their use is limited by the absence of efficient gene expression systems for 
these cells. Furthermore, different mammalian cell types show differences in the pattems of 
glycosylation of the glycoproteins that they secrete. Altering the carbohydrate structure of 
antibodies, for example, can have profound effects on biological activity. Thus there is 
considerable interest in using cell types with particular pattems of glycosylation for generating 
antibodies and other recombinant proteins with different biological activities. 
[0005] Therefore, there is a need for a versatile expression system that could be used in different 
cell types and that permits more rapid cell-line development, hi addition, production of 
recombinant human proteins and antibodies at commercial scale is costly and there is a 
continuing need to develop more cost-efficient manufacturing processes. One general approach 
to achieving increased yields of protein firom fermenters is to generate increased cell biomass and 
to maintain cell viability for longer periods through modifications to the culture medium, feeding 
of nutrients and appropriate adjustments to oxygenation rates. An extension of this approach has 
been to engineer cells to better withstand environmental stress and so prolong viability and 
secretion of product. 

1. High-efficiency Expression Systems 

[0006] A general approach to enhancing productivity of recombinant cell lines is to enhance the 
rate of protein production from each cell. There would be considerable advantages in enhancing 
productivity from single-copy genes such that efficient production could be achieved without the 
need for selection for vector amplification. If expression systems could be developed that 
provide highly efficient single-copy gene expression, these would provide improved yields in 
cell lines for which amplifiable vectors are not effective, as well as potentially providing 
improved yields from cell lines compatible with vector amplification. Efficient single-copy gene 
expression also provides the prospect of more rapid and reliable, and less labor-intensive cell line 
selection. 

[0007] Accordingly, methods have been developed to target introduced vectors to particular sites 
in the genome of the host cell that are particularly favorable for gene expression. Such 
approaches have, in some instances, led to improved gene expression Examples of such an 
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approach are provided in U.S patents 5,648,267, 5,733,779, 6,017,733 and 6,159,730 pertaining 
to the NEOSPLA vector, and U.S. Pat. Nos. 5,830,698 and 5,998,144 pertaining to homologous 
recombination. 

[0008] An alternative approach to ensuring efficient expression from vectors that integrate into 
the host cell genome has been to provide DNA elements on the vector that establish a favorable 
chromatin structure and insulate the genes of interest from chromosomal position effects. A 
number of genetic elements have been identified that may mitigate position effects in integrating 
vectors. It is now generally accepted that eukaryotic genomes are organized into chromatin 
domains containing individual genes or gene clusters sharing co-ordinate regulation in different 
tissues or developmental stages. Several kinds of DNA elements marking the boundaries of these 
domains have been identified including Matrix attachment regions (MARs, also called scaffold 
attachment regions, SARs); locus control regions (LCRs), ubiquitous chromatin opening 
elements (UCOEs; WO 00/05393; WO 02/081677) and insulators (including sub-elements 
designated 'Carriers" Recillas-Targa et al (2002) Proc Natl Acad Sci USA. 99:6883-8.). Of 
these, UCOEs and barriers are attractive elements that can function in a variety of cell types to 
isolate vector elements from position effects. UCOEs have been used for the production of 
recombinant antibodies in suspension-adapted CHO cells suitable for manufacturing 
(WO 02/099089; WO 02/099070). However, the yields currently attainable with these expression 
systems are not typically as high as can be attained using DHFR or GS-mediated vector 
amplification. 

[0009] The use of transcriptional activator proteins ("transactivators") in combination with 
highly efficient promoters has also been explored. The Adenovirus El a transactivator has been 
used in transfected CHO cells to increase expression levels directed from the human 
cytomegalovirus (hCMV) major immediate early (MIE) promoter-enhancer. This expression 
system has been used for expression of antibodies and other proteins (Cockett et al 1991 Nucl. 
Acids Res. 19:319-25; US patent 5,866,359; US patent 6,653,101; Bender et al 1993 Hum 
Antibodies Hybridomas. 4:74-9, each of these references is incorporated herein by reference). 
Subsequently, others have shown that Adenovirus El gene products can function to provide 
efficient expression in a human fetal retina-derived line PER.C6 although the levels of antibody 
reported do not achieve the highest levels attained by vector amplification in rodent cells (Jones 
et al (2003) Biotechnol Prog. 19: 163-8). El a also acts as a transactivator in the human embryonic 
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kidney cell line HEK293, which is widely used at laboratory scale for transient transfection and 
has been used in the development of defective adenoviral vectors. However, this cell line does 
not seem suitable for large-scale growth in suspension culture. 

[0010] The Ela gene of Adenoviruses encodes two major proteins as a result of differential 
splicing. The proteins are characterized by their sedimentation coefficients (13S and 12 S) or 
their molecular size (289 and 243 amino acids, respectively, in Ad5) and are identical except for 
the presence of an additional "unique" region toward the C terminus of the 289 amino-acid 
protein (in one of three regions of the molecule, which are highly conserved between different 
strains of Adenovirus, designated conserved region 3). This region is required for transactivation 
and so only the 289 amino-acid form of Ela is a potent transactivator. The other conserved 
regions, CRl and CR2 have been implicated in other functions of Ela in modulation of the cell 
cycle. 

[0011] Ela proteins lack a sequence-specific DNA-binding activity, but modulate cellular gene 
expression by interacting with a diverse array of cellular factors, among them sequence-specific 
transcription factors, proteins of the general transcription machinery, co-activators and 
chromatin-modifying enzymes such as the histone acetylases, p300 and CBP (Ogryzko (1996) 
Cell 87:953-9). 

[0012] Ela contains a minimum of four independent transactivation functions (reviewed by 
Frisch and Mymryk 2002 Nat Rev Mol Cell Biol 3, 441-52). The unique region and regions in 
exons 1 and 2 can each independently induce gene expression. Of the trans-activation pathways 
through which Ela acts, that involving CR3 is the most potent: most of the activation of viral 
early genes and of exogenous cellular genes by Ela is brought about by this region. CR3 alone is 
sufficient to activate transcription, as microinjection of a synthetic 49 residue peptide 
representing CR3 into HeLa cells activates expression (Lillie et al (1987) Cell 50:1091-1100). 
CR3 consists of two domains At its C-terminal end, residues 183-188 interact with a promoter- 
bound transcription factors, in this way recruiting Ela to a promoter. Once recruited, the 
activation domain in the N-terminal part of CR3, residues 141-178, containing a zinc finger, 
binds to the TATAA box binding protein, TBP, and activates transcription by stimulating the 
formation of a multi-protein transcription initiation complex with RNA polymerase IT. Evidence 
suggests that the region of CR3 that binds TBP binds another cellular factor as well (reviewed by 
Frisch and Mymryk 2002 Nat Rev Mol Cell Biol 3, 441-52). 
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[0013] The mechanism for activation of the hCMV-MIE promoter-enhancer by adenovirus El a 
has also been studied. In addition to activation of TATAA box associated transcription factors, 
the CR3 domain of the Ela protein is necessary for transactivation mediated by the 19 base-pair 
repeats in the CMV enhancer, which contain binding sites for the CREB/ATF family of 
transcription factors (Sanchez et al 2000 Am J Respir Cell Mol Biol. 23:670-7). Certain ATF 
family members interact directly with the CR3 domain of Ela-289 (Chatton et al 1993 Mol Cell 
Biol. 13:561-70). The adenovirus E la-associated cellular protein p300 also acts as a 
transcriptional cofactor that interacts with the repressor YYl and mediates the rehef of YYl 
transcriptional repression by Ela. CREB has been shown to be modulated by Ela in a p300 
binding-dependent manner (Lee et al 1996 J Biol Chem.271: 17666-74). This effect of Ela was 
correlated with a specific physical interaction between CREB and p300. Two separate domains 
within p300 were identified as capable of activating transcription. One of the domains interacted 
with the basal transcription factor TFIIB, indicating that p300 functions as a coactivator by 
making contacts with both CREB/ATF family members and the basal transcriptional machinery. 
[0014] Despite the extensive characterization of Ela function, Ela transactivation has not been 
successfully appUed to other cell types for the production of recombinant proteins. Its utility in 
mammalian cells has been limited because, in addition to its role in transcription, Ela is pro- 
apoptotic (White et al. 1991 J. Virol. 65: 2968-2978) and hence leads to early death of the host 
cell if expressed at high level. Apoptosis seems to be p53-dependent in human and rodent cells. 
Expression of adenovirus Ela protein deregulates normal cellular constraints on cell-cycle 
regulation by interacting with RB, with its relatives pi 07 and pi 30, and with the transcriptional 
co-activators p300 and CBP (Cuconati and White (2002 Genes and Development 16: 2465- 
2478). These activities activate apoptosis mediated by p53 (Debbas and White 1993 Genes Dev. 
7: 546-554) a pathway that includes processing and activation of initiator procaspase-8, 
redistribution of cytochrome c, and activation of procaspase-3. Apoptosis is thought to be 
triggered primarily by binding of Ela to the retinoblastoma protein (RB). In the Gq/Gi phase of 
the cell cycle, hypophosphorylated pRB complexes with transcription factors of the E2F family 
preventing their ability to activate transcription. Cell cycle dependent phosphorylation of pRB by 
cyc\in/cdk complexes releases E2F to activate transcription of target genes required for S-phase 
of the cell cycle. Expression of Ela overrides the normal cellular control of the pRR-E2F 
interaction by binding hypophosphorylated pRB and releasing E2F. Among other effects the 
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aberrant induction of E2F, leads to elevation of p53 levels and activation of the p53-dependent 
pro-apoptotic activity. 

2. Apoptosis Protective Proteins 

[0015] Apoptosis (programmed cell death) is a conserved cellular process that is required for 
normal development and is observed, in some instances, in cultured cell lines subject to 
environmental stresses such as serum-starvation, nutrient depletion or accumulation of toxic 
metabolites. However, cells in culture also die by an altemative, and less controlled process, 
necrosis. In the induction of apoptosis, numerous stimuli and signaling pathways converge to 
bring about the demise of the cell via activation of cysteine aspartic proteases (caspases), the key 
effectors of cell death. In one pathway of caspase activation, procaspase-9 is activated in a 
cytoplasmic protein complex known as the apoptosome by association with cytochrome c and 
other pro-apoptotic effector proteins released into the cytosol from the mitochondria. 
Mitochondrial release is, in tum, triggered by permeabilization of the mitochondrion by BAX or 
BAK proteins positively regulated by BID and BAD proteins at the surface of the 
mitochondrion. The activation of BID and BAD can be induced by the key regulator of the cell- 
cycle, p53. p53 responds to numerous signals relating to the transcriptional status of the cell by 
triggering apoptosis, cell-cycle arrest or cell proliferation. 

[0016] In addition to activators of apoptosis, there are also multiple proteins within the cell that 
serve as checkpoints to restrict or prevent inappropriate apoptosis. In recent years the inhibitor of 
apoptosis (lAP) family of proteins have emerged as key inhibitors of the caspase cascade and 
thus represent central regulatory factors in apoptotic signaling. The lAPs, XIAP, cIAP-1, cIAP-2 
and Survivin, can prevent caspase activation by blocking cytochrome c-induced activation of 
procaspase-9. Bcl-2 and Bcl-X(L) represent two additional apoptosis regulators that act upstream 
of the caspases to prevent BAX and BAK activation and hence limit apoptosis induction. 
[0017] Anti-apoptotic genes have been introduced into cell lines expressing recombinant 
products in attempts to achieve prolonged maintenance of cell viability and hence increased 
yields of product. lAPs have been introduced into CHO cells and 293 cells and delayed cell 
death caused by depleted culture medium (Sauerwald TM, Betenbaugh MJ, Oyler GA (2002) 
Biotechnol Bioeng. 77:704-16; Sauerwald TM, Oyler GA, Betenbaugh MJ (2003) Biotechnol 
Bioeng. 81:329-40). Apoptosis-resistant CHO DUKX-Bll cells have been prepared by 
transfection of Bcl-2 (Lee and Lee (2003) Biotechnol. Bioeng. 82: 872-6). Bcl-2 has also been 
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introduced into NSO cells expressing a chimeric antibody that led to a modest (40%) increase in 
antibody titer (Tey et al 2000 J. Biotechnol. 79: 147-159). The activity of Bcl-2 is believed to be 
modulated by cleavage of a regulatory domain, the imstructured loop domain, present in both 
Bcl-2 and Bcl-X(L) (residues 32-80 in human Bcl-X(L); Chang et al (1997) EMBO J. 16:968- 
77). Deletion of this loop domain has been reported to lead to enhanced anti-apoptotic activity 
(Chang et al (1997) EMBO J. 16:968-77; Figueroa et al (2001) Biotechnol. Bioeng. 5: 21 1-222). 
[0018] Adenoviruses generate a viral homologue of Bcl-2, the Elb-19K protein, which has a 
similar anti-apoptotic activity. Elb-19K has been transfected into recombinant NSO cells and 
shown to increase productivity of an antibody under perfusion conditions (Mercille and Massie 
(1999) Biotechnol. Bioeng. 5: 529-543). However, no increase in product yield was seen as a 
result of Elb-19K expression if the same cell line was grown in a batch fermentation (Mercille, 
et al., Biotechnol Bioeng. 63:516-528 (1999)). E1B-19K has also been evaluated alone and in 
combination with Aven in DHFR-minus CHO cells for delaying apoptosis in a host cell 
expressing the recombinant antibody n)EC-131 in a high vector copy-number, DHFR gene- 
amplification system (WO 03/006607). In this case a 20% increase in product yield was achieved 
using Aven and E1B-19K together. Such attempts to regulate cell death have had limited 
success, possibly in part because cells in culture can also respond to environmental stress by 
necrosis. 

[0019] The modest increases in productivity demonstrated by delaying apoptosis in batch 
fermentations of mammalian cells are distinct firom the teachings of the present invention, which 
use anti-apoptotic proteins as protective proteins to permit the efficient expression of a 
transactivator in the same cell. 

3. Adenoviral Protection Against Apoptosis 

[0020] In cell lines transformed with large segments of adenoviral DNA, such as the human 
HEK 293 cell line, expression of other adenoviral gene products compensates for the pro- 
apoptotic effects of El a to permit cell survival by interfering with p53-dependent apoptotic 
pathways. The Elb region encodes two proteins, E1B-19K and E1B-55K that interfere with 
different elements of the apoptosis pathways in cells. E1B-55K acts in concert with another 
adenoviral protein, E4-Orf6, to inactivate p53, while E1B-19K is a homologue of the cellular 
Bcl-2 protein and acts in a similar manner to Bcl-2 to regulate downstream mediators of 
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apoptosis, particularly BAX and BAK. The E4 and Elb genes are both regulated by El a and 
cellular transcription factors in human cells such that, in HEK293 cells, they are produced in 
sufficient quantity to inhibit apoptosis. In further dissection of these apoptosis pathway in human 
cells, co-expression of E1B19K, Bcl-2 or RB has been shown to significantly inhibit Ela- 
mediated cell death (Putzer BM, Stiewe T, Parssanedjad K, Rega S, Esche H. (2000) Cell Death 
and Differentiation 7, 177-188; Cuconati and White (2002 Genes and Development 16: 2465- 
2478.) 

[0021] In primary cultures of rodent cells, Ela and Elb co-operate to permit oncogenic 
transformation. In these cells, Ela in the absence of Elb induces apoptosis in a p53-dependent 
manner. Indeed elevation of p53 expression is sufficient to induce apoptosis (Hale and 
Braithwaite (1999) J. Biol. Chem. 274: 23777-23786) and fimctional p53 is required for Ela- 
induced apoptosis primary rat kidney cells (Lowe and Ruley (1993) Genes Dev. 7: 535-545; 
Debbas and White (1993) Genes Dev. 7: 546-554). 

[0022] The effects of adenoviral proteins on cellular signaling pathways in permanent 
estabUshed rodent cell lines, such as those used for production of recombinant proteins, have not 
been extensively studied. Nevertheless, there are several indications of fimdamental differences 
between the control of cellular proliferation and apoptosis in primary cells and established cell 
lines. Thus primary cells respond to adjacent cell contact by growth arrest (a phenomenon known 
as "contact inhibition" of cell proliferation). In contrast, established cell Unes are typically not 
sensitive to contact inhibition. Elb has been shown to influence apoptotic pathways in only some 
cell lines (such as NSO) and not, for example, in hybridomas derived firom NSO (Mercille et al 
(1999) Biotechnol Bioeng.63:5 16-528), indicating that pathways of cell growth-regulation differ 
in different cell lines. 

[0023] The p53 protein is a critical regulator of the cell cycle in normal cells and is fi-equently 
mutated and hence defective in oncogenically transformed cells and cultured cell lines. As 
described above, p53 is also a key mediator of Ela-induced apoptosis in primary rodent cells and 
in productive adenoviral infections in human cells. However, the fact that p53 may be defective 
in many established cell lines suggests that responses to Ela will be different in such cells. Thus, 
for example the Chinese hamster ovary cell line CHO-Kl has a mutation in codon 21 1 of the p53 
gene leading to a protein with mutant fimction, abnormally high-level expression and lacking 
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normal cell-cycle control function at the Gi checkpoint ( Hu et al (1999) Mutation Res. 426: 51- 
62). 

[0024] El a has also been transfected into CHO cells for enhancing recombinant gene expression. 
Substantial cell killing due to El a expression was noted in this case, although the pathways 
leading to apoptosis have not been fully characterized. Thus careful titration of the level of El a 
expression is required in CHO cells in order to select cell lines that have sufficient El a for 
transactivation but low enough levels not to interfere unduly with cell survival and proliferation. 
Methods for accomplishing this are described in U.S. Patent Nos. 5,866,359 and 6,653,101. The 
present invention provides improved methods for transactivation that allow efficient expression 
of a recombinant protein under control of a transactivator in mammalian cell lines and provides 
protection from transactivator-mediated apoptosis, regardless of the level of El a expression. 

C. Summary of the Invention 

[0025] It is an object of the invention to provide methods to enhance transcription of a 
recombinant gene within a mammahan host cell using a co-expressed transcriptional activator. It 
is a further object of the invention to minimize or prevent adverse effects of the transactivator on 
host cell survival and proliferation. 

[0026] One aspect of the invention provides vector systems for introducing nucleic acids into a 
mammalian host cell comprising (a) a first cistron encoding a transactivator under control of a 
first promoter, (b) a second cistron encoding an apoptosis-protective protein, and (c) a third 
cistron encoding at least one desired polypeptide under control of a second promoter, wherein 
said second promoter is responsive to the transactivator protein. The vector system can contain 
the first, second and third cistrons in separate vectors, in two vectors (one containing a single 
cistron and the second containing two), or in a single vector containing all three cistrons. In a 
preferred embodiment the first cistron encodes an adenoviral El a protein or a variant thereof, 
more preferably an El a protein from human Ad2, Ad5 or Adl2. In another preferred 
embodiment, the transactivator is CREB and the second promoter comprises a CREB-binding 
element. Desired polypeptides include, but are not limited to, cytokines, growth hormones, 
antibodies or antibody fragments, single-chain antibodies, and fusion proteins. The vector 
system, optionally, includes a fourth cistron encoding at least a second desired polypeptide under 
control of a third promoter that is responsive to the transactivator protein, Altematively, the 
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vector system, optionally, comprises a fourth cistron under control of the second promoter. 
Apoptosis protective proteins include, for example, the ElB 19K protein, Bcl-2, Bcl-X(L), 
retinoblastoma (RB) protein, and a p53 mutant protein that acts as a dominant negative mutant 
and abrogates the activity of wild-type p53. 

[0027] Another aspect of the invention provides methods of enhancing the yield of a desired 
protein in a mammalian host cell comprising introducing into the mammahan cell: (a) a first 
cistron encoding a transactivator under control of a first promoter, (b) a second cistron encoding 
an apoptosis-protective protein, and (c) a third cistron encoding a desired polypeptide under 
control of a second promoter, wherein said second promoter is responsive to the transactivator 
protein. Preferably, expression of the transactivator protein under control of the first promoter in 
the absence of the apoptosis-protective protein would cause significant cell death and expression 
of the apoptosis-protective protein prevents cell-killing due to expression of the transactivator. 
In a preferred embodiment, the mammahan host cell is selected firom the group consisting of a 
CHO cell, a mouse myeloma cell, a mouse hybridoma cell, a rat myeloma cell, and a rat 
hybridoma cell. The first, second and third cistrons can be introduced to the host cell on the 
same or separate vectors. Additionally, one or two cistrons can be introduced into the host cell 
line and the other cistron(s) introduced subsequently. 

[0028] Yet another aspect of the invention provides mammalian host cells comprising a first 
cistron encoding a transactivator, a second cistron encoding an apoptosis-protective protein that 
prevents cell-killing due to expression of the transactivator, and a third cistron encoding one or 
more desired proteins imder the control of a promoter responsive to the transactivator. In a 
preferred embodiment, the transactivator is expressed firom an efficient heterologous promoter at 
a level that, in the absence of the protective protein, causes significant cell death. Preferably, the 
mammalian host cell is not a human cell. An aspect of the invention also provides methods for 
producing a recombinant protein comprising culturing the mammalian host cells in a suitable 
medium such that the desired protein(s) is secreted into the medium. 

[0029] A fiirther aspect of the invention provides manraialian host cells comprising a cistron 
encoding a variant El a protein and a cistron encoding a recombinant protein under the control of 
an activatable promoter, wherein the El a protein retains transactivation activity but is defective 
in the ability to trigger apoptosis. Preferably, the variant El a comprises a point mutation in the 
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N-terminal region (which includes the N-terminal portion of Ela, up to and including CRl) that 
inhibits binding to retinoblastoma protein (KB). 

[0030] Yet another aspect of the invention provides methods of expressing a desired polypeptide 
in a mammalian host cell comprising introducing into the mammahan host cell (a) a first cistron 
encoding a variant Ela protein under control of a first promoter, w^herein the Ela protein retains 
transactivation activity but is defective in the ability to trigger apoptosis and (b) a second cistron 
encoding a desired polypeptide imder control of a second promoter, wherein said second 
promoter is responsive to the transactivator protein. In a preferred embodiment, this method 
further comprises introducing into the host cell a third cistron encoding an apoptosis-protective 
protein. 

II. Brief Description of the Figures 

[0031] For a better imderstanding of the nature and objects of some embodiments of the 
invention, reference should be made to the following detailed description taken in conjunction 
with the accompanying drawings, in which: 

[0032] FIG. 1 provides the amino acid sequence of hamster Bcl-2 (SEQ ID NO: 1). 

[0033] FIG. 2 provides the amino acid sequence of hamster ABcl-2 (SEQ ID NO: 2). 

[0034] FIG. 3 provides the nucleotide sequence of the hamster ABcl-2 cDNA open reading 

frame (SEQ ID NO: 3). 

[0035] FIG. 4 provides the amino acid sequence of Ela-289R pm47/124 protein (SEQ ID NO: 

4) . 

[0036] FIG. 5 provides the nucleotide sequence of Ela pm47/124 coding region (SEQ ID NO: 

5) . 

[0037] FIG. 6 provides the nucleotide sequence of E1B-19K coding sequence spanning the 
EcoRl site to the Sail site (SEQ ID NO: 15). 

[00381 FIG. 7 (FIG. 7A-7D) provides the nucleotide (SEQ ID NO: 18) and deduced amino acid 
sequence (SEQ ID NO: 19) of human RB coding sequence cloned between Sail and Notl 
restriction sites. 

[00391 FIG. 8 (FIG. 8A-8B) provides the nucleotide (SEQ ID NO: 20) and deduced amino acid 
(SEQ ID NO: 21) sequence of variant human CREB-A coding sequence cloned between EcoRl 



11 



and Sail sites. The single point mutation at nucleotide 374 is underlined and the resulting Y134F 
amino acid substitution is shown in bold. 

III. Detailed Description 

A. Overview 

[0040] The present invention is directed generally to compositions and methods for enhancing 
expression of desired proteins by a mammalian host cell using a co-expressed transcriptional 
activator. In particular, the invention provides vectors, host cells and methods of expressing at 
least one desired polypeptide by introducing into a mammalian host cell cistrons encoding a 
transactivator, a desired polypeptide and an apoptosis-protective protein. The invention also 
provides vectors, host cells and methods of expressing desired polypeptides by stably 
introducing into a manraialian host cell a cistron encoding a variant El a protein that retains 
transactivation activity but is defective in the ability to trigger apoptosis. 

B. Definitions 

[0041] The term "cistron" refers to a specific sequence of nucleotides that encodes one or more 
polypeptide(s). More than one cistron may be transcribed from a single promoter. 
[0042] As used herein, the term "variant," as it applies to a polynucleotide refers to a 
polynucleotide sequence that differs from the corresponding wild type at one or more 
nucleotides. Polypeptide "variant," as the term is used herein, is a polypeptide that typically 
differs from a wildtype polypeptide sequence in one or more substitutions, deletions, additions 
and/or insertions. For example, certain illustrative variants of the polypeptides of the invention 
include those in which one or more portions, such as an N-terminal leader sequence or flexible 
loop region, have been removed. Other illustrative variants include variants in which a small 
portion (e.g., 1-30 amino acids, preferably 5-15 amino acids) has been removed from the N- 
and/or C-terminal of the mature protein. Such polynucleotide or polypeptide variants may be 
naturally occurring or may be synthetically generated, for example, by modifying one or more of 
the above polypeptide sequences of the invention and evaluating their immunogenic activity as 
described herein and/or using any of a number of techniques well known in the art. 
[0043] The term "transactivator, " as used herein, refers to a polypeptide product that interacts, 
directly or indirectly, with a region associated with a promoter, thereby turning on (activating) 
transcription of a structural gene. 
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[0044] The term "defective in the abihty to trigger apoptosis" refers to a characteristic of a 
variant transactivator that when expressed from a vector under the control of a CMV promoter 
decreases transfection efficiency by no more than 90% as compared to transfection of a control 
vector containing no transactivator. Preferably, the variant transactivators of the invention will 
decrease transfection efficiency by no more than 50%. 

[0045] The term "apoptosis-protective protein," as used herein refers to a polypeptide product 
that, when expressed, decreases the rate of apoptosis in a population of cells, 
[0046] "Isolated," as used herein in reference to nucleotides, means that a polynucleotide is 
substantially away from other coding sequences, and that a DNA molecule does not contain large 
portions of unrelated coding DNA, such as large chromosomal fragments or other functional 
genes or polypeptide coding regions. Of course, this refers to the DNA molecule as originally 
isolated, and does not exclude genes or coding regions later added to the segment by the hand of 
man. An "isolated" polypeptide is one that is removed from its original enviroimient. For 
example, a naturally-occurring protein or polypeptide is isolated if it is separated from some or 
all of the coexisting materials in the natural system. 

[0047] As used herein, the terms "protein" and "polypeptide" are used in their conventional 
meaning, i.e., as a sequence of amino acids. The polypeptides are not limited to a specific length 
of the product; thus, peptides, oligopeptides, and proteins are included within the definition of 
polypeptide, and such terms may be used interchangeably herein unless specifically indicated 
otherwise. This term also does not refer to or exclude post-expression modifications of the 
polypeptide, for example, glycosylations, acetylations, phosphorylations and the like, as well as 
other modifications known in the art, both naturally occurring and non-naturally occurring. 

C. Expression Systems of the Invention 

[0048] The invention provides improved methods for the activation of transcription from 
activatable cistrons by a transcriptional activator in a mammalian host cell that minimize or 
prevent adverse effects of the transactivator on cell growth and survival. 

[0049] The activatable cistron or cistrons are expressed from one or more promoters responsive 
to the transactivator. Examples include promoters or enhancers containing CREB-binding 
elements or the 19bp repeat from the hCMV-MIE enhancer and promoters containing the 
TATAA transcription initiation signal. Preferred enhancers are cytomegalovirus enhancers such 
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as the enhancer from the hCMV-MIE gene or the mouse CMV-MIE gene. A preferred promoter 
is the TATAA box region of the hCMV-MIE promoter. The activatable promoters are used to 
direct transcription of genes encoding recombinant protein products. 

[0050] Transactivators of the invention are polypeptides that enhance the expression of a desired 
gene by interacting directly or indirectly with nucleic acid sequences that are located in cis to the 
desired gene. The activator of the invention is a protein that serves to activate transcription of an 
activatable cistron when expressed in a host cell but that shows adverse effects on host cell 
growth and survival when expressed in its normal or immutated form at wildtype levels. In one 
embodiment, the transactivator of the invention is a protein that serves to activate transcription 
from an activatable control region in a host cell but that shows adverse effects on host cell 
growth and survival when expressed in its normal or unmutated form. Examples of such 
transactivators include El a proteins froni adenoviruses and mammalian proteins that are 
components of signal transduction pathways, including mammalian p53, c-myc and cyclic-AMP 
response-element binding proteins (CREBs). 

[0051] A preferred activator of the invention is an El a activator from an adenovirus such as a 
human, monkey or rodent adenovirus. Preferred examples include El a from human Ad2, Ad5 
and Adl2. The Ela activator contains an activation domain such as that present in the 289 amino 
acid Ad5 Ela protein. The Ela protein may also be a variant Ela protein and contain point 
mutations or deletions. In another embodiment, CREB proteins activate transcription of 
promoters containing DNA sequences known as cyclic-AMP response elements (CREs) and 
typically do so in response to cAMP-mediated signal transduction involving phosphorylation of 
CREB protein by cAMP-activated Protein Kinase A. A constitutively active mutant CREB has 
been characterized, which does not require cAMP-mediated signal-transduction to activate 
promoters containing CREs (Du et al (2000) Mol. Cell. Biol. 20: 4320-4327). However, the 
practical utility of CREB as an activator protein is limited since overexpression of CREB 
proteins has been shown to induce apoptosis in several cell lines (Saeki et al (1999) Biochem. J. 
343: 249-255). Similarly, p53 regulates the expression of nimierous genes and its 
overexpression induces apoptosis (Yonish-Rouach et al (1993) Mol. Cell. Biol. 13: 1415-1423; 
Yonish-Rouach et al (1995) Oncogene 11: 2197-205). The present invention overcomes this 
constraint. 
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1. Apoptosis Protective Proteins 

[0052] One embodiment of the invention provides methods of enhancing transcription of a 
cistron within a mammaHan host cell comprising introducing into a mammalian host cell (a) a 
first cistron encoding a transactivator under control of a first promoter, (b) a second cistron 
encoding an apoptosis-protective protein, and (c) a third cistron encoding a desired polypeptide 
under control of a second promoter, wherein said second promoter is responsive to the 
transactivator protein. Preferably, expression of the transactivator protein under control of the 
first promoter in the absence of the apoptosis-protective protein would cause significant cell 
death and expression of the apoptosis-protective protein prevents cell-killing due to expression of 
the transactivator. 

[00531 The apoptosis-protective protein of the invention serves to prevent transactivator- 
mediated cell death. Examples of suitable proteins include a target to which El a binds and that 
initiates a pro-apoptotic signaling cascade such as the retinoblastoma (RB) protein. Alternatively 
the protective protein may be a downstream inhibitor of E la-stimulated cell killing such as an 
anti-apoptotic protein. An example of a suitable anti-apoptotic protein is a mutant p53 protein 
that acts as a dominant negative mutant and abrogates the activity of the wild-type p53. Other 
suitable anti-apoptotic proteins include proteins that interact with BAX or BAK and inhibit their 
activity, such as Bcl-2, Bcl-X(L) or an adenovirus ElB 19K protein. Altemative anti-apoptotic 
proteins include an inhibitor of apoptosome formation such as Aven, or a downstream apoptosis 
inhibitor such as LAP or survivin. Preferably the anti-apoptotic protein is a cellular protein firom a 
species related to that of the host cell, such as a hamster or rodent protein. A particularly 
preferred protective protein is a variant Bcl-2 protein that has a deletion in the regulatory loop 
domain. 

[0054] Most preferably, the protective protein is a variant hamster Bcl-2 deleted of the loop 
domain. FIG. 1 shows the sequence of a hamster Bcl-2 protein (Genbank AJ271720) and FIG. 2 
provides a novel deletion mutant, hamster ABcl-2 in which the loop domain is replaced by a 
sequence of 4 alanine residues. The nucleotide sequence of the hamster ABcl-2 is shown in 
Figure 3. 

[0055] The protective protein of this aspect of the invention is expressed in an expression vector 
allowing appropriate expression in the mammalian host cell. Preferably the protective protein is 
expressed fi-om an efficient heterologous promoter such as a SV40, RSV or CMV promoter. If 
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the protective protein is E1B-19K, it is preferred that the E1B-19K coding sequence is under the 
control of a non-adenoviral promoter. 

[0056] The combination of transactivator, protective protein and activatable cistron in the same 
cell leads to significant increases in productivity of the activatable cistron. Preferably the specific 
production rate (production rate per cell) is enhanced at least two-fold by the combination of the 
transactivator and the protective protein. More preferably, the specific production rate is 
enhanced at least five-fold. 

2. Variant Transactivator Proteins 

[0057] hi addition, the invention also provides methods of expressing a desired polypeptide in a 
mammalian host cell comprising introducing into a mammalian host cell: (a) a first cistron 
encoding a variant El a protein under control of a first promoter, wherein the variant El a protein 
retains transactivation activity but is defective in the ability to trigger apoptosis and (b) a second 
cistron encoding a desired polypeptide under control of a second promoter, wherein said second 
promoter is responsive to the transactivator protein. This aspect of the invention also includes 
mammahan host cells comprising a first cistron encoding a variant El a protein that is defective 
in apoptosis induction, and a second cistron encoding a desired polypeptide under the control of 
an activatable promoter, preferably wherein the host cell is not a human cell. 
[0058] The Ela protein of this aspect of the invention retains the transactivation activity of Ela- 
289 but is defective in binding to the retinoblastoma protein and thereby permits cell survival 
and proliferation. Sequences required for RB binding have been described (for example see 
Shisler et al 1996 J. Virol 70: 68-77). Two regions of the Ela protein interact with RB, a region 
at the N-terminus, encompassing sequences in CRl, and a distinct site in CR2, Suitable 
mutations that inactivate RB binding are deletions or point mutations in the RB-binding sites in 
the N-terminal/CRl region of Ela and in the CR2 region, e.g. deltaCR2 (Samuelson and Lowe 
(1997) Proc. Natl. Acad. Sci. 94: 12094-12099). In a particularly preferred embodiment of this 
aspect of the invention, the Ela transactivator is a Ela-289 protein that retains regions of the 
molecule required for p300 binding but that is defective in RB binding. Most preferably, the 
variant Ela has mutations in either one or both of the two regions that bind RB. An example is 
the deletion of residues 26-35 and 111-123 (Shisler et al 1996 J. Virol 70: 68-77;. Another 
example is the combination of two point mutations: changing tyrosine to histidine at amino acid 
47 (47H) and cysteine to glycine at residue 124 (124G) (pm 47/124; Samuelson and Lowe 1997 
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Proc. Natl. Acad. Sci. 94: 12094-12099). The sequence of Ad5 Ela pm47/124 is shown in Figure 
4 and the nucleotide sequence in Figure 5 (in which the mutated residues are underlined). The 
nucleotide sequence contains an intron that is differentially spliced to generate both the 243 and 
289 amino-acid forms of Ela (Ela-243R and Ela-289R). Only the 289 amino-acid fomi is a 
potent transactivator. 

[0059] Additionally, the Ela protein may be modified at its N-terminus to increase the protein's 
stability. For example deletion of arginine at amino acid 2 or deletion of the first 14 residues may 
lead to the production of a more stable protein (Slavicek et al (1988) EMBO J. 7:3171-80). 
[0060] In an altemative embodiment of this aspect of the invention, the variant Ela protein is a 
novel variant protein identified by screening or selecting a variant Ela defective in RB binding 
from a plurality of variant Ela proteins. A library of variant Ela proteins can be generated and 
screened in a microbial expression system for binding to RB. The library of variant Ela proteins 
is preferably a focused library in which sequence variation is restricted to particular regions of 
the Ela molecule. More preferably, mutations are restricted to the N-terminal (including CRl) 
and CR2 regions of Ela. For illustration, the library of variant Ela proteins may be expressed in 
E, coli as a fiision protein with a bacterial beta-lactamase and variants defective in RB binding 
selected using an in vitro selection system. For this purpose, RB protein or a fragment of RB 
capable of binding to wildtype Ela may be co-expressed with the Ela variant library. The RB 
fragment is co-expressed, for example, as a fusion protein with an inhibitor of beta-lactamase 
such as a BLIP protein and defective Ela variants are selected by their ability to prevent BLIP- 
fiision protein associating with beta-lactamase. In this way, E. coli cells containing a defective 
Ela are able to grow in the presence of the antibiotic ampicillin whereas cells containing Ela 
capable of binding RB are Icilled by this concentration of ampicillin. An example of a suitable 
BLIP protein is the BLIP from Streptomyces clavuligerus (Strynadka et al (1994) Nature 368: 
657-660). The RB protein can be fiised to either the carboxy- or amino-terminus of BLIP via a 
peptide linker such as a peptide of the sequence (Gly-Gly-Gly-Gly-Ser) also designated (Gly4- 
Ser) or multiples thereof An example of a beta-lactamase protein is a masked beta-lactamase 
described in WO/03/069312. The Ela protein is fiised to the N- or C-terminus of beta-lactamase 
by a Gly4-Ser Hnker peptide The masked beta-lactamase has reduced affinity for BLIP and binds 
BLIP detectably only in the presence of associated RB and Ela. 
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[0061 J A polypq)tide 'Variant," as the term is used herein, is a polypeptide that typically differs 
from its wildtype counterpart in one or more substitutions, deletions, additions and/or insertions. 
Such variants may be naturally occurring or may be synthetically generated, for example, by 
modifying one or more of the above polypeptide sequences of the invention and evaluating their 
immunogenic activity as described herein and/or using any of a number of techniques well 
known in the art. 

[0062] In many instances, a variant will contain conservative substitutions. A "conservative 
substitution" is one in which an amino acid is substituted for another amino acid that has similar 
properties, such that one skilled in the art of peptide chemistry would expect the secondary 
structure and hydropathic nature of the polypeptide to be substantially imchanged. Modifications 
may be made in the structure of the polynucleotides and polypeptides of the present invention 
and still obtain a functional molecule that encodes a variant or derivative polypeptide with 
desirable characteristics, e.g., with the ability to transactivate transcription. When it is desired to 
alter the amino acid sequence of a polypeptide to create an equivalent, or even an improved 
variant or portion of a polypeptide, one skilled in the art will typically change one or more of the 
codons of the encoding DNA sequence according to Table 1. 

[0063] For example, certain amino acids may be substituted for other amino acids in a protein 
structure without appreciable loss of interactive binding capacity. Since it is the interactive 
capacity and nature of a protein that defines that protein's biological functional activity, certain 
amino acid sequence substitutions can be made in a protein sequence, and, of course, its 
underlying DNA coding sequence, and nevertheless obtain a protein with like properties. It is 
thus contemplated that various changes may be made in the peptide sequences of the disclosed 
compositions, or corresponding DNA sequences that encode said peptides without appreciable 
loss of their biological utility or activity. 



TABLE 1 



Amino Acids 


3-letter 


1 -letter 


Codons 


Alanine 


Ala 


A 


GCA GCC GCG GCU 


Cysteine 


Cys 


C 


UGCUGU 


Aspartic acid 


Asp 


D 


GAG GAU 


Glutamic acid 


Glu 


E 


GAA GAG 


Phenylalanine 


Phe 


F 


uucuuu 


Glycine 


Gly 


G 


GGA GGC GGG GGU 


Histidine 


His 


H 


CAC CAU 


Isoleucine 


He 


I 


AUAAUCAUU 
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Lysine 


Lys 


K 


AAA AAG 


Leucine 


Leu 


L 


UUA UUG CUA cue CUG CUU 


Methionine 


Met 


M 


AUG 


Asparagine 


Asn 


N 


AACAAU 


Proline 


Pro 


P 


CCA CCC CCG ecu 


Glutamine 


Gin 


Q 


CAACAG 


Arginine 


Arg 


R 


AGA AGG CGA CGC CGG CGU 


Serine 


Ser 


S 


AGC AGU UCA UCC UCG UCU 


Threonine 


Thr 


T 


ACAACCACGACU 


Valine 


Val 


V 


GUAGUCGUGGUU 


Tryptophan 


Trp 


W 


UGG 


Tyrosine 


Tyr 


Y 


U AC UAU 



[0064] In making such changes, the hydropathic index of amino acids may be considered. The 
importance of the hydropathic amino acid index in conferring interactive biologic function on a 
protein is generally understood in the art (Kyte and Doolittle, 1982, incorporated herein by 
reference). It is accepted that the relative hydropathic character of the amino acid contributes to 
the secondary structure of the resultant protein, which in turn defines the interaction of the 
protein with other molecules, for example, enzymes, substrates, receptors, DNA, antibodies, 
antigens, and the like. Each amino acid has been assigned a hydropathic index on the basis of its 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982). These values are: 
isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); 
methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (- 
0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (-3.5); aspartate 
(-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

[0065] It is known in the art that certain amino acids may be substituted by other amino acids 
having a similar hydropathic index or score and still result in a protein with similar biological 
activity, i.e. still obtain a biological functionally equivalent protein. In making such changes, the 
substitution of amino acids whose hydropathic indices are within ±2 is preferred, those within ±1 
are particularly preferred, and those within +0.5 are even more particularly preferred. It is also 
understood in the art that the substitution of like amino acids can be made effectively on the 
basis of hydrophilicity. U. S. Patent 4,554,101 (specifically incorporated herein by reference in 
its entirety), states that the greatest local average hydrophilicity of a protein, as governed by the 
hydrophilicity of its adjacent amino acids, correlates with a biological property of the protein. 
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[0066] As detailed in U. S. Patent 4,554,101, the following hydrophilicity values have been 
assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); glutamate 
(+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (-0.4); 
proline (-0.5 ± 1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); 
leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). It is 
understood that an amino acid can be substituted for another having a similar hydrophilicity 
value and still obtain a biologically equivalent protein or a protein with similar functional 
characteristics. In such changes, the substitution of amino acids whose hydrophihcity values are 
within ±2 is preferred, those within ±1 are particularly preferred, and those within ±0.5 are even 
more particularly preferred. 

[0067] As outlined above, amino acid substitutions are generally therefore based on the relative 
similarity of the amino acid side-chain substituents, for example, their hydrophobicity, 
hydrophihcity, charge, size, and the like. Exemplary substitutions that take various of the 
foregoing characteristics into consideration are well known to those of skill in the art and 
include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine and 
asparagine; and vaUne, leucine and isoleucine. 

[0068] Amino acid substitutions may further be made on the basis of similarity in polarity, 
charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic nature of the residues. 
For example, negatively charged amino acids include aspartic acid and glutamic acid; positively 
charged amino acids include lysine and arginine; and amino acids with uncharged polar head 
groups having similar hydrophilicity values include leucine, isoleucine and valine; glycine and 
alanine; asparagine and glutamine; and serine, threonine, phenylalanine and tyrosine. Other 
groups of amino acids that may represent conservative changes include: (1) ala, pro, gly, glu, 
asp, gin, asn, ser, thr; (2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) 
phe, tyr, tip, his. A variant may also, or alternatively, contain nonconservative changes. In a 
preferred embodiment, variant polypeptides differ from a native sequence by substitution, 
deletion or addition of five amino acids or fewer. Variants may also (or alternatively) be 
modified by, for example, the deletion or addition of amino acids that have minimal influence on 
the immunogenicity, secondary structure and hydropathic nature of the polypeptide, 
[0069] Variant El a genes according to this aspect of the invention, originating from the library 
of variants expressed in E. coli or from another source, can be introduced into mammahan 
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expression vectors and expressed in mammalian host cells from a promoter that is active in the 
chosen host cell. Preferably the promoter used is an efficient heterologous promoter such as a 
RS V-LTR promoter or a cytomegalovirus promoter. 

[0070] Preferably, the host cell is a non-human host cell, more preferably a rodent cell such as a 
CHO cell (e.g. CHO-S, DG44, DUKX-Bll, CHO-Kl), a mouse myeloma or a hybridoma cell 
(e.g. NSO, SP2/0), or rat myeloma or hybridoma line (e.g. YB2/0). The host cell line comprises a 
cistron encoding the variant El a protein and a cistron for a recombinant protein under the control 
of an activatable promoter. The two cistrons can be introduced to the host cell on the same or 
separate vectors. The cistrons can be introduced to the host cell by any of the methods widely 
known in the art including electroporation, calcium-phosphate co-precipitation, cell fusion and 
cationic lipid mediated transfection. At least one selectable marker gene is included in order to 
select for stable integration of plasmid DNA into the host cell genome. The selectable marker 
can be on the same plasmid as the transactivator cistron or the activatable cistron or on a separate 
plasmid or DNA fragment. Additional elements may be included with the activatable gene to 
ensure appropriate high-level expression, including ubiquitous chromatin opening elements, 
insulators, barrier elements, introns, polyadenylation signals, 5 '-untranslated regions and signal 
peptides. 

[0071] Li this aspect of the invention, the expression of the variant El a protein leads to at least 
two-fold enhancement of expression of the activatable cistron in the host cell. Preferably the 
expression of the variant El a leads to five-fold enhancement of expression of the activatable 
cistron. The variant El a protein is efficiently expressed in the host cell and does not significantly 
inhibit the survival and proUferation of the host cell. 
D. Host Cells of the Invention 

[0072] An aspect of the invention provides a mammalian host cell comprising a first cistron 
encoding a transactivator, a second cistron encoding an apoptosis-protective protein that prevents 
cell-killing due to expression of the transactivator , and a third cistron encoding one or more 
desired proteins under the control of a promoter responsive to the transactivator. In a preferred 
embodiment, the transactivator is expressed from an efficient heterologous promoter at a level at 
which, in the absence of the protective protein, significant cell death would occur. Another 
aspect of the invention provides a mammalian host cell comprising a first cistron encoding a 
variant transactivator that retains transactivation activity but is defective in the ability to trigger 
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apoptosis, and a second cistrons encoding one or more desired polypeptides under the control of 
a promoter responsive to the variant transactivator. 

[0073] The cell line may be maintained adhering to plastic or another surface or may be 
maintained in suspension culture. Numerous tissue culture media are available for the growth of 
mammalian cell lines and the appropriate medium will be selected for the particular cell line and 
growth conditions. Additives and modifications to the basal medium may be made. For example 
one or more proteins such as insulin, transferrin or albxmien, or animal serum may be added if 
desired. Lipid supplements may be required for certain cell lines such as the myeloma NSO. 
Antibiotics and selective agents such as zeocin, G418, hygromycin, methotrexate or methionine 
sulphoximine may also be added to maintain sterility or to maintain selective pressure according 
to the vectors introduced into the host cell. In a preferred embodiment of this aspect of the 
invention, the host cell is a CHO cell Une maintained in serum-free suspension culture, hi a 
particularly preferred embodiment, the CHO line is the CHO-S cell line (Invitrogen) cultured in 
a protein-free culture medium. An example of a suitable medixmi is CD-CHO medium 
(Invitrogen). 

[0074] Any nxmiber of selection systems may be used to recover cell-lines containing the nucleic 
acids. These include, but are not limited to, the herpes simplex virus thymidine kinase (Wigler, 
M. et al. (1977) Cell 11:223-32) and adenine phosphoribosyltransferase (Lowy, 1. et al. (1990) 
Cell 22:817-23) genes that can be employed in tk' or aprt cells, respectively. Also, 
antimetabolite, antibiotic or herbicide resistance can be used as the basis for selection; for 
example, DHFR, which confers resistance to methotrexate (Wigler, M. et al. (1980) Proc. Natl. 
Acad Sci. 77:3567-70); glutamine synthetase (GS), which confers glutamine-independent growth 
and resistance to methionine sulphoximine (Bebbington et al. (1992) Biotechnology 10(2): 169- 
75; and Cockett et al. (1991) Nucleic Acids Res. 25;19(2):319-25); npt, which confers resistance 
to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et al (1981) J Mol Biol. 
150:1-14); and als or pat, which confer resistance to chlorsulfuron and phosphinotricin, 
respectively (Murry, supra). Additional selectable genes have been described, for example, trpB, 
which allows cells to utilize indole in place of tryptophan, or his^ which allows cells to utilize 
histinol in place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc, Natl Acad Sci 
85:8047-5). The use of visible markers has gained popularity with such markers as green 
fluorescent protein (GFP), anthocyanins, betaglucuronidase and its substrate GUS, and luciferase 
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and its substrate luciferin, being widely used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector 
system (Rhodes, et al. (1 995) Methods Mol. Biol. 55:121-13 1). 

[0075] Although the presence/absence of marker gene expression suggests that the cistron of 
interest is also present, the presence and expression of the desired polypeptide may need to be 
confirmed. For example, if the sequence encoding a desired polypeptide is inserted within a 
marker gene sequence, recombinant cells containing sequences can be identified by the absence 
of marker gene function. Alternatively, a marker gene can be placed in tandem with a 
polypeptide-encoding sequence under the control of a single promoter. Expression of the marker 
gene in response to induction or selection usually indicates expression of the tandem gene as 
well. 

[0076] Alternatively, host cells that contain and express a desired polynucleotide sequence may 
be identified by a variety of procedures known to those of skill in the art. These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizations and protein bioassay or 
immunoassay techniques, which include, for example, membrane, solution, or chip based 
technologies for the detection and/or quantification of nucleic acid or protein. 
[0077] A variety of protocols for detecting and measuring the expression of desired polypeptide 
products, using either polyclonal or monoclonal antibodies specific for the product are known in 
the art. Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay 
(RIA), and fluorescence activated cell sorting (FACS). A two-site, monoclonal-based 
inmiunoassay utilizing monoclonal antibodies reactive to two non-interfering epitopes on a given 
polypeptide may be preferred for some applications, but a competitive binding assay may also be 
employed. These and other assays are described, among other places, in Hampton, R. et al. 
(1990, Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, et 
al. (1983; J Exp. Med 158:1211-1216). 

[0078] A wide variety of labels and conjugation techniques are known by those skilled in the art 
and may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PGR probes for detecting sequences related to polynucleotides include oligo 
labeling, nick translation, end-labeling or PGR amplification using a labeled nucleotide. 
Altematively, the sequences, or any portions thereof may be cloned into a vector for the 
production of an mRNA probe. Such vectors are known in the art, are commercially available, 
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and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA 
polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted 
using a variety of commercially available kits. Suitable reporter molecules or labels, which may 
be used include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents 
as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 
[0079] A newly synthesized peptide may be substantially purified by preparative high 
performance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures and 
Molecular Principles, WH Freeman and Co., New York, N.Y.) or other comparable techniques 
available in the art. The composition of the synthetic peptides may be confirmed by amino acid 
analysis or sequencing (e.g., the Edman degradation procedure). 

E. Vectors of the Invention 

10080] The vector systems of the invention may comprise any vector capable of transferring 
DNA to a cell. Preferably, the vector is an integrating vector or an episomal vector. Preferred 
integrating vectors include recombinant retroviral vectors. A recombinant retroviral vector will 
include DNA of at least a portion of a retroviral genome which portion is capable of infecting the 
target cells. The term "infection" is used to mean the process by which a virus transfers genetic 
material to its host or target cell. Preferably, the retrovirus used in the construction of a vector of 
the invention is also rendered replication-defective to remove the effect of viral replication of the 
target cells. Jn such cases, the replication defective viral genome can be packaged by a helper 
virus in accordance with conventional techniques. Generally, any retrovirus meeting the above 
criteria of infectiousness and capability of functional gene transfer can be employed in the 
practice of the invention. 

10081] Suitable retroviral vectors include but are not limited to pLJ, pZip, pWe and pEM, well 
known to those of skill in the art. Suitable packaging virus hnes for replication-defective 
retroviruses include, for example, 4^rip, H'Cre, ^2 and 4^ Am. 

[0082] Other vectors useful in the present invention include adenovirus, adeno-associated virus, 
SV40 virus, vaccinia virus, HSV and pox virus vectors. A preferred vector is an adenovirus 
vector. Adenovirus vectors are well known to those skilled in the art and have been used to 
deliver genes to numerous cell types, including airway epithelium, skeletal muscle, liver, brain 
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and skin (Hitt, et al. (1997) Pharmacology 40: 137-206; and Anderson (1998) Nature 392 (6679 
Suppl): 25-30). 

[0083] A further preferred vector is the adeno-associated (AAV) vector. AAV vectors are well 
known to those skilled in the art and have been used to stably transduce human T-lymphocytes, 
fibroblasts, nasal polyp, skeletal muscle, brain, erythroid and hematopoietic stem cells for gene 
therapy apphcations (Philip et al., 1994, MoL Cell. Biol, 14:2411-2418; Russell et al, 1994, 
PNAS USA, 91:8915-8919; Flotte et al., 1993, PNAS USA, 90:10613-10617; Walsh et al., 1994, 
PNAS USA, 89:7257-7261; Miller et al., 1994, PNAS USA 91:10183-10187; Emerson, 1996, 
Blood, 87:3082-3088). International Patent AppUcation WO 91/18088 describes specific AAV 
based vectors. 

[0084] Preferred episomal vectors include transient non-replicating episomal vectors and self 
replicating episomal vectors with fiinctions derived firom viral origins of replication such as those 
firom EBV, human papovavirus (BK) and BPV Such integrating and episomal vectors are well 
known to those skilled in the art and are fiiUy described in the body of literature well known to 
those skilled in the art. In particular, suitable episomal vectors are described in W098/07876. 
[0085] Manmialian artificial chromosomes are also preferred vectors for use in the present 
invention. The use of mammaUan artificial chromosomes is discussed by Calos (1996, Trends 
Genet 12:463-466). 

[0086] Various vectors and methods for introducing them into host cells are available for 
obtaining appropriate expression of the transactivator and the protective cistron of the invention. 
For example the cistrons can each be expressed from a strong constitutive promoter such as the 
hCMV-MBE promoter in plasmids pCI-neo or pCDNA3, or firom a RSV-LTR promoter. The 
cistrons can be on the same or separate plasmids and appropriate selectable markers are used to 
select for clones in which the vectors have integrated into the genome. For example markers that 
confer resistance to hygromycin, neomycin (G418) or zeocin can be used. The cistron or cistrons 
used to express the recombinant protein may also be introduced on the same or a different 
plasmid and can be introduced using various selectable markers including resistance markers or 
amplifiable markers such as GS or DHFR. Various additional elements that are known in the art 
may be included with the transactivator cistron, the protective cistron or the activatable cistron to 
ensure appropriate levels of expression. For example splice sites, polyadenylation signals, 5' or 
3' untranslated regions may be added. For the activatable cistron, efficient expression is desired 
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and additional elements intended to enhance expression levels may be added, including UCOEs, 
insulators, barrier elements and signal peptides. The vectors for expression of the cistrons are 
introduced into the manmialian host cell by any of the available methods including calcium 
phosphate co-precipitation, electroporation or cationic lipid mediated transfection. 
[0087] Suitable vector systems for expression of recombinant proteins and/or polypeptides 
according to the present invention may include one or more of the following attributes: (a) ease 
of manipulation; (b) elements that make high-level expression site-of-integration independent; 
(c) elements that make expression resistant to silencing/repression thereby allowing for 
sustained, stable expression over long periods of time; and (d) elements that express at high- 
levels in different cell types and in different species. 

[0088] In order to express a desired protein and/or polypeptide, the nucleotide sequences 
encoding the polypeptide, or functional equivalents, may be inserted into appropriate expression 
vector, i.e., a vector that contains the necessary elements for the transcription and translation of 
the inserted coding sequence. Methods that are well known to those skilled in the art may be 
used to construct expression vectors containing sequences encoding a polypeptide of interest and 
appropriate transcriptional and translational control elements. These methods include in vitro 
recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. Such 
techniques are described, for example, in Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et al. (1989) 
Current Protocols in Molecular Biology, John Wiley & Sons, New York. N.Y. 
[0089] Depending on the vector system and host utilized, any number of suitable transcription 
and translation elements, including constitutive and inducible promoters, may be used. Li 
mammalian cell systems, promoters from mammalian genes or from mammalian viruses are 
generally preferred. If it is necessary to generate a cell-line that contains multiple copies of the 
sequence encoding a polypeptide, vectors containing GS or DHFR selectable markers or vectors 
based on SV40 or EBV may be advantageously used with an appropriate selectable marker. 
[0090] In mammahan host cells, a number of viral-based expression systems are generally 
available. For example, in cases where an adenovirus is used as an expression vector, sequences 
encoding a polypeptide of interest may be ligated into an adenovirus transcription/translation 
complex consisting of the late promoter and tripartite leader sequence. Insertion in a non- 
essential El or E3 region of the viral genome may be used to obtain a viable virus that is capable 
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of expressing the polypeptide in infected host cells (Logan and Shenk, (1984) Proc. Natl Acad 
Sci USA 81:3655-3659). In addition, transcription enhancers, such as the Rous sarcoma virus 
(RS V) enhancer, may be used to increase expression in mammalian host cells. 
[0091] Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding a polypeptide of interest. Such signals include the ATG initiation codon and 
adjacent sequences. In cases where sequences encoding the polypeptide, its initiation codon, and 
upstream sequences are inserted into the appropriate expression vector, no additional 
transcriptional or translational control signals may be needed. However, in cases where only 
coding sequence, or a portion thereof, is inserted, exogenous translational control signals 
including the ATG initiation codon should be provided. Furthermore, the initiation codon should 
be in the correct reading fi-ame to ensure translation of the entire insert. Exogenous translational 
elements and initiation codons may be of various origins, both natural and synthetic. 
[0092] As will be understood by those of skill in the art, expression vectors containing 
polynucleotides of the invention may be designed to contain signal sequences that direct 
secretion of the encoded polypeptide through a eukaryotic cell membrane. Other recombinant 
constructions may be used to join sequences encoding a polypeptide of interest to nucleotide 
sequence encoding a polypeptide domain that will faciHtate purification of soluble proteins. Such 
purification facilitating domains include, but are not limited to, metal chelating peptides such as 
histidine-tryptophan modules that allow purification on immobilized metals, protein A domains 
that allow purification on immobilized immunoglobulin, and the domain utilized in the FLAGS 
extension/affinity purification system (Immunex Corp., Seattle, Wash.). The inclusion of 
cleavable linker sequences such as those specific for Factor XA or enterokinase (Invitrogen) 
between the purification domain and the encoded polypeptide may be used to facilitate 
purification. 

[0093] One such expression vector provides for expression of a fusion protein containing a 
polypeptide of interest and a nucleic acid encoding six histidine residues preceding a thioredoxin 
or an enterokinase cleavage site. The histidine residues faciUtate purification on IMIAC 
(immobilized metal ion affinity chromatography) as described in Porath, et al. (1992, ProL Exp, 
Pwn/3:263-281) while the enterokinase cleavage site provides a means for purifying the desired 
polypeptide fi*om the fiision protein. A discussion of vectors that contain fiision proteins is 
provided in KroU, et al. (1993; DNA Cell Biol 12:441-453). 
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F. Introducing Nucleic Acid into Host Cells 

[0094] A vector of the invention may be delivered to a host cell non-specifically or specifically 
(i.e., to a designated subset of host cells) via a viral or non-viral means of delivery. Suitable 
protocols are readily known and/or available to those of skill in the art. Exemplary protocols that 
are suitable for achieving high-level, large-scale introduction of nucleic acids include 
electroporation, calcium phosphate-mediated transfection, cell fusion, and those recommended 
by Invitrogen/Gibco for transfection of the CHO-S host cell-line. Generally, positive selection of 
cells containing the nucleic acid may be achieved using agents such as, for example, 
hygromycin, G418, and puromycin. Following selection, the pool of resulting clones may, 
optionally, be further subcloned to identify individual clones with the desired levels of protein 
expression. 

[0095] Preferred delivery methods of viral origin include viral particle-producing packaging cell 
lines as transfection recipients for the vector of the present invention into which viral packaging 
signals have been engineered, such as those of adenovirus, herpes vimses and papovaviruses. 
Preferred non-viral based gene delivery means and methods may also be used in the invention 
and include direct naked nucleic acid injection, nucleic acid condensing peptides and non- 
peptides, cationic liposomes and encapsulation in liposomes. 

G. Nucleic Acids of the Invention 

[0096] The nucleic acid sequences of the invention are further directed to sequences that encode 
variants of the described nucleic acids. These amino acid sequence variants may be prepared by 
methods known in the art by introducing appropriate nucleotide changes into a native or variant 
polynucleotide. There are two variables in the construction of amino acid sequence variants: the 
location of the mutation and the nature of the mutation. The amino acid sequence variants of the 
nucleic acids are preferably constructed by mutating the polynucleotide to give an amino acid 
sequence that does not occur in nature. These amino acid alterations can be made at sites that 
differ in the nucleic acids from different species (variable positions) or in highly conserved 
regions (constant regions). Sites at such locations will typically be modified in series, e.g., by 
substituting first with conservative choices (e.g., hydrophobic amino acid to a different 
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hydrophobic amino acid) and then with more distant choices {e.g., hydrophobic amino acid to a 
charged amino acid), and then deletions or insertions may be made at the target site. 
[0097] Amino acid sequence deletions generally range from about 1 to 30 residues, preferably 
about 1 to 10 residues, and are typically contiguous. Amino acid insertions include amino- 
and/or carboxyl-terminal fusions ranging in length from one to one hundred or more residues, as 
well as intrasequence insertions of single or multiple amino acid residues. Intrasequence 
insertions may range generally from about 1 to 10 amino residues, preferably from 1 to 5 
residues. Examples of terminal insertions include the heterologous signal sequences necessary 
for secretion or for intracellular targeting in different host cells. 

[0098] In a preferred method, polynucleotides encoding the variant transactivator proteins are 
changed via site-directed mutagenesis. This method uses oligonucleotide sequences that encode 
the desired amino acid variant, as well as a sufficient adjacent nucleotide on both sides of the 
changed amino acid to form a stable duplex on either side of the site of being changed. In 
general, the techniques of site-directed mutagenesis are well known to those of skill in the art 
and this technique is exempUfied by publications such as, Edehnan et al., DNA 2:183 (1983). A 
versatile and efficient method for producing site-specific changes in a polynucleotide sequence 
was published by ZoUer and Smith, Nucleic Acids Res. 10:6487-6500 (1982). 
[0099] PCR may also be used to create variants of the transactivator proteins of the invention. 
When small amounts of template DNA are used as starting material, primer(s) that differs 
slightly in sequence from the corresponding region in the template DNA can generate the desired 
amino acid variant. PCR amplification resuhs in a population of product DNA fragments that 
differ from the polynucleotide template encoding the collagen at the position specified by the 
primer. The product DNA firagments replace the corresponding region in the plasmid and this 
gives the desired amino acid variant. 

[00100] A fiirther technique for generating amino acid variants is the cassette mutagenesis 
technique described in Wells et al.. Gene 34:315 (1985); and other mutagenesis techniques well 
known in the art, such as, for example, the techniques in Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY, 1989, and 
Current Protocols in Molecular Biology, Ausubel et al. 

[00101] Due to the inherent degeneracy of the genetic code, other DNA sequences that 
encode substantially the same or a fimctionally equivalent amino acid sequence may be used in 
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the practice of the invention for the cloning and expression of these novel nucleic acids. Such 
DNA sequences include those that are capable of hybridizing to the appropriate novel nucleic 
acid sequence under stringent conditions. 

[00102] Polynucleotides of the invention may comprise a native sequence (i.e. an 
endogenous sequence that encodes a protein and/or polypeptide or a portion thereof) or may 
comprise a sequence that encodes a variant or derivative. Typically, polynucleotide variants will 
contain one or more substitutions, additions, deletions and/or insertions, preferably such that the 
activity of the polypeptide encoded by the variant polynucleotide is not substantially diminished 
relative to a polypeptide encoded by the wildtype gene. For example, a variant Ela protein of the 
invention would maintain the ability to transactivate transcription. The term "variants" should 
also be understood to encompass homologous genes of xenogeneic origin. 
[00103] The polynucleotides of the present invention, or fragments thereof, regardless of 
the length of the coding sequence itself, may be combined with other DNA sequences, such as 
promoters, polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, 
other coding segments, and the like, such that their overall length may vary considerably. It is 
therefore contemplated that a nucleic acid fragment of almost any length may be employed, with 
the total length preferably being limited by the ease of preparation and use in the intended 
recombinant DNA protocol. For example, illustrative polynucleotide segments with total lengths 
of about 10,000, about 5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 
100, about 50 base pairs in length, and the like, (including all intermediate lengths) are 
contemplated to be useful in many implementations of this invention. 

[00104] Polynucleotides suitable for high-level, large-scale expression according to the 
present invention may be identified, prepared and/or manipulated using any of a variety of well 
established techniques- (see generally, Sambrook et al.. Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY, 1989, and other like 
references). For example, a polynucleotide may be identified by screening a micro array of 
cDNAs for tumor-associated expression. Such screens may be performed, for example, using the 
microarray technology of Affymetrix, Inc. (Santa Clara, CA) according to the manufacturer's 
instructions (and essentially as described by Schena et al, Proc, Natl Acad, Set USA 93:10614- 
10619, 1996 and Heller et al.. Proa Natl Acad ScL USA 94:2150-2155, 1997). Altematively, 



30 



polynucleotides may be amplified firom cDNA prepared firom cells expressing the proteins 
described herein, such as tumor cells. 

[00105] Many template dependent processes are available to amplify a target sequences of 
interest present in a sample. One of the best known ampUfication methods is the polymerase 
chain reaction (PGR), which is described in detail in U.S. Patent Nos. 4,683,195, 4,683,202 and 
4,800,159, each of which is incorporated herein by reference in its entirety. Briefly, in PGR, two 
primer sequences are prepared that are complementary to regions on opposite complementary 
strands of the target sequence. An excess of deoxynucleoside triphosphates is added to a reaction 
mixture along with a DNA polymerase (e.g., Taq polymerase). If the target sequence is present 
in a sample, the primers will bind to the target and the polymerase will cause the primers to be 
extended along the target sequence by adding on nucleotides. By raising and lowering the 
temperature of the reaction mixture, the extended primers will dissociate fi'om the target to form 
reaction products, excess primers will bind to the target and to the reaction product and the 
process is repeated. Preferably reverse transcription and PGR T* ampUfication procedure may be 
performed in order to quantify the amount of mRNA amplified. Polymerase chain reaction 
methodologies are well known in the art. 

[00106] Any of a mmiber of other template dependent processes, many of which are 
variations of the PGR amplification technique, are readily known and available in the art. 
Illustratively, some such methods include the ligase chain reaction, described, for example, in 
Eur. Pat. Appl. Publ. No. 320,308 and U.S. Patent No. 4,883,750; QP Replicase, described in 
PCX Ml. Pat. Appl. Publ. No. PCT/US87/00880; Strand Displacement Amplification (SDA) and 
Repair Chain Reaction (RCR). Still other amplification methods are described in Great Britain 
Pat. Appl. No. 2 202 328, and in PCX Intl. Pat. Appl. Publ. No. PGX/US89/01025. Other nucleic 
acid amplification procedures include transcription-based amplification systems (XAS) (PGX 
Intl. Pat. Appl. Publ. No. WO 88/10315), including nucleic acid sequence based amplification 
(NASBA) and 3SR. Eur. Pat. Appl. Publ. No. 329,822 describes a nucleic acid amplification 
process involving cyclically synthesizing single-stranded RNA ("ssRNA"), ssDNA, and double- 
stranded DNA (dsDNA). PCX Intl. Pat. Appl. Publ. No. WO 89/06700 describes a nucleic acid 
sequence amplification scheme based on the hybridization of a promoter/primer sequence to a 
target single-stranded DNA ("ssDNA") followed by transcription of many RNA copies of the 
sequence. Another such technique is known as "rapid amplification of cDNA ends" or RAGE. 
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This technique involves the use of an internal primer and an external primer, which hybridizes to 
a polyA region or vector sequence, to identify sequences that are 5' and 3* of a known sequence. 
Additional techniques include capture PCR (Lagerstrom et al, PGR Methods Applic. 1:1 1 1- 
19,1991) and walking PCR (Parker et al, Nucl. Acids. Res. 19:3055-60,1991). Other methods 
employing amplification may also be employed to obtain a full length cDNA sequence. Other 
ampUfication methods such as "one-sided PCR" (Ohara, 1989) are also well-known to those of 
skill in the art. 

[00107] An amplified portion of a polynucleotide of the present invention may be used to 
isolate a full length gene from a suitable library (e.g., a tumor cDNA library or a library of 
mutagenized El a DNAs) using well known techniques. Within such techniques, a library is 
screened using one or more polynucleotide probes or primers suitable for amplification. 
[00108] In other embodiments of the invention, polynucleotide sequences or fragments 
thereof which encode polypeptides of the invention, or fusion proteins or functional equivalents 
thereof, may be used in recombinant DNA molecules to direct expression of a polypeptide in 
appropriate host cells. Due to the inherent degeneracy of the genetic code, other DNA sequences 
that encode substantially the same or a functionally equivalent amino acid sequence may be 
produced and these sequences may be used to clone and express a given polypeptide. 
[00109] As will be understood by those of skill in the art, it may be advantageous in some 
instances to produce polypeptide-encoding nucleotide sequences possessing non-naturally 
occurring codons. For example, codons preferred by a particular eukaryotic host can be selected 
to increase the rate of protein expression or to produce a recombinant RNA transcript having 
desirable properties, such as a half-life that is longer than that of a transcript generated from the 
naturally occurring sequence. 

[00110] Moreover, the polynucleotide sequences of the present invention can be 
engineered using methods generally known in the art in order to alter polypeptide encoding 
sequences for a variety of reasons, including but not limited to, alterations that modify the 
cloning, processing, and/or expression of the gene product. For example, DNA shuffling by 
random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides 
may be used to engineer the nucleotide sequences. In addition, site-directed mutagenesis may be 
used to insert new restriction sites, alter glycosylation patterns, change codon preference, 
produce splice variants, or introduce mutations, and so forth. 
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H. Recombinant Polypeptides 

[00111] One aspect of the invention provides a method for producing a recombinant 
protein comprising culturing a cell line of the invention in a suitable medium such that the 
recombinant protein is secreted into the medium. A polypeptide may be an entire protein, or a 
portion thereof. Particular polypeptides of interest in the context of this invention are therapeutic 
polypeptides such as receptors, enzymes, ligands, regulatory factors, hormones, antibodies or 
antibody fragments and structural proteins, or variants thereof Therapeutic polypeptides also 
include sequences encoding nuclear proteins, cytoplasmic proteins, mitochondrial proteins, 
secreted proteins, membrane-associated proteins, serum proteins, Aoral antigens, bacterial 
antigens, protozoal antigens and parasitic antigens. The product may be a single polypeptide 
such as a cytokine, a growth hormone or a single-chain antibody. Alternatively the product may 
be derived from more than one polypeptide such as an immunoglobulin, w^hich contains a heavy- 
chain and a light-chain polypeptide. In this case each polypeptide is expressed from a separate 
copy of the activatable promoter or both polypeptides may be translated from a single mRNA 
with separate open reading frames separated by an intemal ribosome entry site (IRES) element. 
[00112] Within other illustrative embodiments, a polypeptide may be a fiision polypeptide 
that comprises multiple polypeptides. A fusion partner can, for example, assist in expressing the 
protein (an expression enhancer) at higher yields than the native recombinant protein. Other 
fiision partners may be selected so as to increase the solubility of the polypeptide or to enable the 
polypeptide to be targeted to desired intracellular compartments. Still fiirther fiision partners 
include affinity tags, which faciUtate purification of the polypeptide. 

[00113] Fusion polypeptides may generally be prepared using standard techniques, 
including chemical conjugation. Preferably, a fiision polypeptide is expressed as a recombinant 
polypeptide, allowing the production of increased levels, relative to a non-fiised polypeptide, in 
an expression system. Briefly, DNA sequences encoding the polypeptide components may be 
assembled separately, and ligated into an appropriate expression vector. The 3' end of the DNA 
sequence encoding one polypeptide component is ligated, with or without a peptide linker, to the 
5' end of a DNA sequence encoding the second polypeptide component so that the reading 
frames of the sequences are in phase. This permits translation into a single fiision polypeptide 
that retains the biological activity of both component polypeptides. 



33 



[00114] A peptide linker sequence may be employed to separate the first and second 
polypeptide components by a distance sufficient to ensure that each polypeptide folds into its 
secondary and tertiary structures. Such a peptide linker sequence is incorporated into the fusion 
polypeptide using standard techniques well known in the art. Suitable peptide linker sequences 
may be chosen based on the following factors: (1) their ability to adopt a flexible extended 
conformation; (2) their inability to adopt a secondary structure that could interact with functional 
epitopes on the first and second polypeptides; and (3) the lack of hydrophobic or charged 
residues that might react with the polypeptide functional epitopes. Preferred peptide linker 
sequences contain Gly, Asn and Ser residues. Other near neutral amino acids, such as Thr and 
Ala may also be used in the linker sequence. Amino acid sequences that may be usefully 
employed as linkers include those disclosed in Maratea et al., Gene 40:39-46, 1985; Murphy et 
al., Proc. Natl. Acad Sci. USA 83:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. Patent 
No. 4,751,180. The linker sequence may generally be firom 1 to about 50 amino acids in length. 
Linker sequences are not required when the first and second polypeptides have non-essential N- 
terminal amino acid regions that can be used to separate the functional domains and prevent 
steric interference. 

[00115] Exemplary desired polypeptides according to the present invention include 
binding agents, such as antibodies and antigen-binding fragments thereof, that exhibit 
immunological binding to a tumor polypeptide disclosed herein, or to a portion, variant or 
derivative thereof An antibody, or antigen-binding fi-agment thereof, is said to "specifically 
bind," "immunologically bind," and/or is "immunologically reactive" to an antigen if it reacts at 
a detectable level (within, for example, an ELISA assay) with the polypeptide, and does not react 
detectably with unrelated polypeptides under similar conditions. 

[00116] Natural antibodies have been known for many years. Natural antibodies comprise 
a generally Y-shaped molecule having an antigen-binding site towards the free end of each upper 
arm. The remainder of the structure, and particularly the stem of the Y, mediates the effector 
functions associated with antibodies. Specifically, antibody molecules are comprised of two 
heavy (H) and two fight (L) polypeptide chains, held together by disulfide bonds. Each chain of 
an antibody chain is divided into regions or domains, each being approximately 110 amino acids. 
The light chain has two such domains while the heavy chain has four domains. The amino acid 
sequence of the amino-terminal domain of each polypeptide chain is highly variable (V region), 
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while the sequences of the remaining domains are conserved or constant (C regions). A Hght 
chain is therefore composed of one variable (Vl) and one constant domain (Cl) while a heavy 
chain contains one variable (Vh) and three constant domains (Chi, Cm and Chs). An arm of the 
Y-shaped molecule consists of a light chain (V+Cl) and the variable domain (Vh) and one 
constant domain (Chi) of a heavy chain. The tail of the Y is composed of the remaining heavy 
chain constant domains (Ch2 +Ch3). The C-terminal ends of the heavy chains associate to form 
the Fc portion. Within each variable region are three hypervariable regions. These hypervariable 
regions are also described as the complementarity determining regions (CDRs) because of their 
importance in binding of antigen. The four more conserved regions of the variable domains are 
described as the framework regions (FRs). Each domain of an antibody consists of two beta- 
sheets held together by a disulfide bridge, with their hydrophobic faces packed together. The 
individual beta strands are linked together by loops. The overall appearance can be described as a 
beta barrel having loops at the ends. The CDRs form the loops at one end of the beta barrel of the 
variable region. 

[00117] A number of therapeutically useful molecules are known in the art that comprise 
antigen-binding sites that are capable of exhibiting immunological binding properties of an 
antibody molecule. The proteolytic enzyme papain preferentially cleaves IgG molecules to yield 
several fragments, two of which (the "F(ab)" fragments) each comprise a covalent heterodimer 
that includes an intact antigen-binding site. The enzyme pepsin is able to cleave IgG molecules 
to provide several fragments, including the "F(ab*)2" fragment which comprises both antigen- 
binding sites. An "Fv" fragment can be produced by preferential proteolytic cleavage of an IgM, 
and on rare occasions IgG or IgA inmiunoglobulin molecule. Fv fragments are, however, more 
commonly derived using recombinant techniques known in the art. Any fragment of an antibody 
that is produced by enzymatic cleavage can also be made using recombinant techniques to 
express the desired fragment without requiring enzymatic processing. The Fv fragment includes 
a non-covalent VH::VL heterodimer including an antigen-binding site, which retains much of the 
antigen recognition and binding capabilities of the native antibody molecule. Inbar et al. (1972) 
Proc. Nat. Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) Biochem 15:2706-2710; and 
Ehrlich et al. (1980) Biochem 19:4091 A single chain Fv ("sFv") polypeptide is a covalently 
linked VH::VL heterodimer that is expressed from a gene fiision including VH- and VL- 
encoding genes linked by a peptide-encoding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. 
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USA 85(16):5879. A number of methods have been described to discem chemical structures for 
converting the naturally aggregated-but chemically separated-light and heavy polypeptide 
chains from an antibody V region into an sFv molecule that will fold into a three dimensional 
stmcture substantially similar to the structure of an antigen-binding site. See, e.g., U.S. Pat. Nos. 
5,091,513 and 5,132,405, to Huston et al.; and U.S. Pat, No. 4,946,778, to Ladner et al. 
[00118] By 1990, over 100 murine monoclonal antibodies were in clinical trials, 
particularly in the U.S. and especially for application in the treatment of CMicer. However, by 
this time it was recognized that rejection of murine monoclonal antibodies by the xmdesirable 
immxme response in humans termed the HAMA (Human Anti-Mouse Antibody) response was a 
severe limitation, especially for the treatment of chronic disease. Therefore, the use of rodent 
MAbs as therapeutic agents in humans is inherently limited by the fact that the human subject 
will mount an immunological response to the MAb and either remove the MAb entirely or at 
least reduce its effectiveness. 

[00119] Proposals have therefore been made for making non-human MAbs less antigenic 
in humans. Such techniques can be generically termed "himianization" techniques. These 
techniques generally involve the use of recombinant DNA technology to manipulate DNA 
sequences encoding the polypeptide chains of the antibody molecule. The use of recombinant 
DNA technology to clone antibody genes has provided an alternative whereby a murine 
monoclonal antibody can be converted to a predominantly human-form (i.e., humanized) with 
the same antigen binding properties. Generally, the goal of the humanizing technology is to 
develop humanized antibodies with very httle or virtually no murine component apart from the 
CDRs so as to reduce or eliminate their immunogenicity in humans. 

[00120] A number of "humanized" antibody molecules comprising an antigen binding site 
derived from a non-human immunoglobulin have been described. One method utilizes chimeric 
antibodies. "Chimeric" antibodies comprise a light chain and a heavy chain: the light chain is 
comprised of a light chain variable region and a light chain constant region; the heavy chain is 
comprised of a heavy chain variable region and a heavy chain constant region. Chimeric 
antibodies comprise variable regions from one species and constant regions from another species 
(for example, mouse variable regions joined to human constant regions). (See, e.g., U.S. Pat. 
Nos. 4,816,397 and 4,816,567, Winter et al. (1991) Nature 349:293-299; Lobuglio et al. (1989) 
Proc. Nat. Acad. Sci. USA 86:4220-4224; Shaw et al. (1987) J Immunol. 138:45344538; and 
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Brown et al. (1987) Cancer Res. 47:3577-3583). Other methods of humanizing antibodies 
include grafting rodent CDRs into a human supporting FR prior to fusion with an appropriate 
human antibody constant domain (Riechmann et al. (1988) Nature 332:323-327; Verhoeyen et al. 
(1988) Science 239:1534-1536; and Jones et al. (1986) Nature 321:522-525), and providing 
rodent CDRs supported by recombinantly veneered rodent FRs (European Patent Publication No. 
519.596, published Dec. 23, 1992). These "humanized'' molecules are designed to minimize 
xmwanted immunological response toward rodent antihuman antibody molecules, which limits 
the duration and effectiveness of therapeutic appUcations of those moieties in human recipients. 
[00121] Recombinant technology now allows the preparation of antibodies having the 
desired specificity from recombinant genes encoding a range of antibodies (Van Dijk et al, 
1989; incorporated herein by reference). Certain recombinant techniques involve the isolation of 
the antibody genes by immunological screening of combinatorial antibody phage expression 
libraries prepared from RNA isolated from the spleen of an immunized animal (Morrison et al., 
1 986; Winter and Milstein, 1 99 1 ; each incorporated herein by reference). 
[00122] For such methods, combinatorial antibody phagemid libraries are prepared from 
RNA isolated from the spleen of the immimized animal, and phagemids expressing appropriate 
antibodies are selected by panning using cells expressing the antigen and control cells. The 
advantages of this approach over conventional hybridoma techniques are that approximately 104 
times as many antibodies can be produced and screened in a single round, and that new 
specificities are generated by H and L chain combination, which further increases the percentage 
of appropriate antibodies generated. 

[00123] One method for the generation of a large repertoire of diverse antibody molecules 
in bacteria utilizes the bacteriophage lambda as the vector (Huse et al., 1989; incorporated herein 
by reference). Production of antibodies using the lambda vector involves the cloning of heavy 
and light chain populations of DNA sequences into separate starting vectors. The vectors are 
subsequently combined randomly to form a single vector that directs the co-expression of heavy 
and light chains to form antibody fragments. The heavy and light chain DNA sequences are 
obtained by ampUfication, preferably by PCR or a related amplification technique, of mRNA 
isolated from spleen cells (or hybridomas thereof) from an animal that has been immunized with 
a selected antigen. The heavy and light chain sequences are typically amplified using primers 
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that incorporate restriction sites into the ends of the amplified DNA segment to facilitate cloning 
of the heavy and light chain segments into the starting vectors. 

[00124] Another method for the generation and screening of large libraries of wholly or 
partially synthetic antibody combining sites, or paratopes, utiUzes display vectors derived fi-om 
filamentous phage such as Ml 3, fl or fd. These filamentous phage display vectors, referred to as 
"phagemids," yield large libraries of monoclonal antibodies having diverse and novel 
immimospecificities. The technology uses a filamentous phage coat protein membrane anchor 
domain as a means for linking gene-product and gene during the assembly stage of filamentous 
phage replication, and has been used for the cloning and expression of antibodies fi-om 
combinatorial libraries (Kang et al., 1991; Barbas et al, 1991; each incorporated herein by 
reference). 

[00125] This general technique for filamentous phage display is described in U.S. Pat, No. 
5,658,727, incorporated herein by reference. In a most general sense, the method provides a 
system for the simultaneous cloning and screening of pre-selected Ugand-binding specificities 
fi-om antibody gene repertoires using a single vector system. Screening of isolated members of 
the library for a pre-selected ligand-binding capacity allows the correlation of the binding 
capacity of an expressed antibody molecule with a convenient means to isolate the gene that 
encodes the member from the library. 

[00126] Linkage of expression and screening is accomplished by the combination of 
targeting of a fiision polypeptide into the periplasm of a bacterial cell to allow assembly of a 
fimctional antibody, and the targeting of a fiision polypeptide onto the coat of a filamentous 
phage particle during phage assembly to allow for convenient screening of the library member of 
interest. Periplasmic targeting is provided by the presence of a secretion signal domain in a 
fiision polypeptide. Targeting to a phage particle is provided by the presence of a filamentous 
phage coat protein membrane anchor domain (i.e., a cpIII- or cpVIII-derived membrane anchor 
domain) in a fiision polypeptide. 

[00127] The diversity of a filamentous phage-based combinatorial antibody library can be 
increased by shuffling of the heavy and light chain genes, by altering one or more of the 
complementarity determining regions of the cloned heavy chain genes of the library, or by 
introducing random mutations into the library by error-prone polymerase chain reactions. 
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Additional methods for screening phagemid libraries are described in U.S. Pat. Nos. 5,580,717; 
5,427,908; 5,403,484; and 5,223,409, each incorporated herein by reference. 
[00128] Another method for producing diverse libraries of antibodies and screening for 
desirable binding specificities is described in U.S. Pat. Nos. 5,667,988 and 5,759,817, each 
incorporated herein by reference. The method involves the preparation of libraries of 
heterodimeric antibody molecules in the form of phagemid libraries using degenerate 
oligonucleotides and primer extension reactions to incorporate the degeneracies into the CDR 
regions of the antibody variable heavy and light chain variable domains, and display of the 
mutagenized polypeptides on the surface of the phagemid. Thereafter, the display protein is 
screened for the ability to bind to a preselected antigen. 

[00129] The method for producing a heterodimeric antibody molecule generally involves 
(1) introducing a heavy or hght chain V region-coding gene of interest into the phagemid display 
vector; (2) introducing a randomized binding site into the phagemid display protein vector by 
primer extension with an oligonucleotide containing regions of homology to a CDR of the 
antibody V region gene and containing regions of degeneracy for producing randomized coding 
sequences to form a large population of display vectors each capable of expressing different 
putative binding sites displayed on a phagemid surface display protein; (3) expressing the display 
protein and binding site on the surface of a filamentous phage particle; and (4) isolating 
(screening) the surface-expressed phage particle using affinity techniques such as panning of 
phage particles against a preselected antigen, thereby isolating one or more species of phagemid 
containing a display protein containing a binding site that binds a preselected antigen. 
[00130] A further variation of this method for producing diverse libraries of antibodies 
and screening for desirable binding specificities is described in U.S. Pat. No. 5,702,892, 
incorporated herein by reference. In this method, only heavy chain sequences are employed, the 
heavy chain sequences are randomized at all nucleotide positions that encode either the CDRI or 
CDRIII hypervariable region, and the genetic variability in the CDRs is generated independent of 
any biological process. 

[00131] In the method, two libraries are engineered to genetically shuffle oligonucleotide 
motifs within the fi-amework of the heavy chain gene structure. Through random mutation of 
either CDRI or CDRIII, the hypervariable regions of the heavy chain gene were reconstructed to 
result in a collection of highly diverse sequences. The heavy chain proteins encoded by the 



39 



collection of mutated gene sequences possessed the potential to have all of the binding 
characteristics of an antibody while requiring only one of the two antibody chains. 
[00132] Specifically, the method is practiced in the absence of the antibody light chain 
protein. A library of phage displaying modified heavy chain proteins is incubated with an 
immobilized ligand to select clones encoding recombinant proteins that specifically bind the 
immobilized ligand. The bound phage are then dissociated fi"om the immobilized ligand and 
amplified by growth in bacterial host cells. Individual viral plaques, each expressing a different 
recombinant protein, are expanded, and individual clones can then be assayed for binding 
activity. 

[00133] Alternatively, the polypeptide expressed using the present invention can be a 
fully-human antibody. To prepare an antibody with a desired specificity from a human patient, 
one would simply obtain human lymphocytes from an individual having the desired specific 
antibodies, for example firom human peripheral blood, spleen, lymph nodes, tonsils or the like, 
utilizing techniques that are well known to those of skill in the art. The use of peripheral blood 
lymphocytes will often be preferred. 

[00134] Human monoclonal antibodies may be obtained fi-om the human lymphocytes 
producing the desired antibodies by immortalizing the human lymphocytes, generally in the 
same manner as described above for generating any monoclonal antibody. The reactivities of the 
antibodies in the culture supematants are generally first checked, employing one or more 
selected antigen(s), and the lymphocytes that exhibit high reactivity are grown. The resulting 
lymphocytes are then fused with a parent line of human or mouse origin, and further selection 
gives the optimal clones. 

[00135] The recovery of monoclonal antibodies from the immortalized cells may be 
achieved by any method generally employed in the production of monoclonal antibodies. For 
instance, the desired monoclonal antibody may be obtained by cloning the immortalized 
lymphocyte by the limiting dilution method or the like, selecting the cell producing the desired 
antibody, growing the selected cells in a medium or the abdominal cavity of an animal, and 
recovering the desired monoclonal antibody from the culture supernatant or ascites. 
[00136] Such techniques have been used, for example, to isolate himian monoclonal 
antibodies to Pseudomonas aeruginosa epitopes (U.S. Pat. Nos. 5,196,337 and 5,252,480, each 
incorporated herein by reference); polyribosylribitol phosphate capsular polysaccharides (U.S. 
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Pat. No. 4,954,449, incorporated herein by reference); the Rh(D) antigen (U.S. Pat. No. 
5,665,356, incorporated herein by reference); and viruses, such as human immunodeficiency 
virus, respiratory syncytial virus, herpes simplex virus, varicella zoster virus and 
cytomegalovirus (U.S. Pat. Nos. 5,652,138; 5,762,905; and 4,950,595, each incorporated herein 
by reference). After isolation, the genes encoding the desired antibody can be identified and 
cloned using standard techniques. The desired genes can then be utilized in the present system to 
express high levels of recombinant human antibody. 

[0Q137] Recombinant technology can also be used to prepare antibodies from transgenic 
mice containing human antibody libraries. Such techniques are described in U.S. Pat. No. 
5,545,807, incorporated herein by reference. In a most general sense, these methods involve the 
production of a transgenic animal that has inserted into its germline genetic material that encodes 
for at least part of an antibody of human origin or that can rearrange to encode a repertoire of 
antibodies. The inserted genetic material may be produced from a himian source, or may be 
produced synthetically. The material may code for at least part of a known antibody or may be 
modified to code for at least part of an altered antibody. 

[00138] The inserted genetic material is expressed in the transgenic animal, resulting in 
production of an antibody derived at least in part from the inserted human antibody genetic 
material. It is found the genetic material is rearranged in the transgenic animal, so that a 
repertoire of antibodies with part or parts derived from inserted genetic material may be 
produced, even if the inserted genetic material is incorporated in the germline in the wrong 
position or with the wrong geometry. 

[00139] Once a suitable transgenic animal has been prepared, the animal is simply 
immunized with the desired immunogen. Depending on the nature of the inserted material, the 
animal may produce a chimeric antibody, e.g. of mixed mouse/human origin, where the genetic 
material of foreign origin encodes only part of the antibody; or the animal may produce an 
entirely foreign antibody, e.g. of wholly human origin, where the genetic material of foreign 
origin encodes an entire antibody. 

[00140] The foregoing transgenic animals are usually employed to produce human 
antibodies of a single isotype, more specifically an isotype that is essential for B cell maturation, 
such as IgM and possibly IgD. Another preferred method for producing human antibodies is 
described in U.S. Pat. Nos. 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016; and 
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5,770,429; each incorporated by reference, wherein transgenic animals are described that are 
capable of switching from an isotype needed for B cell development to other isotypes. The genes 
encoding the desired antibodies generated by the transgenic animal can be cloned utilizing 
standard techniques and produced using the expression system of the present invention. 
[00141] Additional modifications can be made to the sequences of antibodies to alter the 
effector function or to reduce the potential for immunogenicity or for another purpose. For 
example, point mutations may be introduced into the constant region of the antibody Heavy 
chain to modify effector function as described in U.S. Pat. Nos. 5,624,821 and 5,648,260. 
Alterations may be made in the V-region of antibodies with the purpose of altering the binding 
specificity or affinity or to remove T-cell epitopes and hence reducing inmiunogenicity, a 
process known as "de-immunization' \ For this purpose, computer algorithms are used to predict 
peptide sequences within the antibody coding sequence which may be involved in binding to 
human MHC Class 11 molecules. These potential T-cell epitopes may be removed by appropriate 
alteration in the sequence. Such altered or designed antibody sequences may be constructed 
synthetically by construction and assembly of chemically synthesized oUgonucleotides or may be 
introduced into cloned DNA sequences by site-directed mutagenesis according to standard 
techniques known in the art. 

IV. Examples 

[00142] The following examples are provided as a guide for a practitioner of ordinary skill 
in the art. The examples should not be construed as limiting the invention, as the examples 
merely provide specific methodology useful in understanding and practicing an embodiment of 
the invention. 

A. Example 1: Cloning and expression of Ela and a variant Ela defective in RB 
binding 

[00143] The Ela gene fi'om Ad5 is cloned by PGR fi-om adenoviral DNA in the adherent 
HEK293 cell line (Stratagene) using the following ohgonucleotide primers and cloned between 
the EcoRl and Sail sites of pCI-neo (Promega) to construct pCI-Ela.neo. 

Primer 1 : CCCGAATTCGCCGCCACCATGAGACATATTATCTGCCAC (SEQ ID NO: 6) 

Primer 2: CCCGTCGACCTTATGGCCTGGGGCGTTT (SEQ ID NO: 7) 
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[00144] Alternatively, the gene is expressed from the RSV-LTR promoter in the plasmid 
pOPRSVI/MCS (Stratagene) between the Kpnl and Notl sites using the following amplification 
primers to construct an expression plasmid pRSV-Ela. 

Primer 3: CCCGGTACCGCCGCCACCATGAGACATATTATCTGCCAC (SEQ ID NO: 8) 

Primer 4: CCCGCGGCCGCCTTATGGCCTGGGGCGTTT (SEQ ID NO: 9) 

[00145] The tyrosine a amino acid 47 is mutated to histidine (47H) and the cysteine a 
amino acid 124 is mutated to glycine (124G) by PGR mutagenesis. To introduce the 124G 
mutation, the forward primer is: 

Primer 5: GGAGGTGATC GATCTTACCG GCCAC (SEQ ID NO: 10) 

[00146] The reverse primer is either Primer 2 or Primer 4 above for cloning into vectors 
containing Sail or Notl sites respectively. Primer 5 contains a Clal site that can be used for 
reconstruction of the 5' end of the gene (360bp Kpnl - Clal fragment from pRSV-Ela). For this 
purpose, the El a gene is cloned in a plasmid introduced into an E. coli strain that is defective in 
dam methylase. 

[00147] For construction of the 47H mutation in pRSV-Ela, mutagenesis is carried out 
using Primers 3 and 4 together with mutagenesis primers 6 and 7: 

Primer 6: CCTACCCTTCACGAACTGCATGATTTAGACGTGACG (SEQ ID NO: 1 1) 

Primer 7: CGTCACGTCTAAATCATGCAGTTCGTGAAGGGTAGG (SEQ ID NO: 12) 

[00148] PGR reactions are set up using titered amounts of template DNA. (e.g. 1 ng, 4 ng, 
8 ng and 16 ng) and ICQ ng each of forward and reverse primers. Reactions are run using Pfii 
polymerase (Stratagene) according to the manufacturer's instmctions. PGR fragments are 
digested with appropriate restriction enzymes and subcloned into plasmid vectors. All DNA 
clones are verified by DNA sequencing; the nucleotide coding sequence for mutant Ela (47/124) 
is given in Figure 5. 

[00149] The vectors for expressing Ela or a variant Ela are introduced into CHO-S cells 
by lipofectamine-mediated transfection (Invitrogen) according to the manufacturer's instructions. 
Stable transfectants are selected using G418 (0.5 mg/ml). Because wild-type Ela induces 
apoptosis in the host cell when expressed from the efficient hCMV or RSV-LTR promoters, the 
transfection efficiency for E la-expressing plasmids is greatly reduced compared to a pCI-neo 



43 



control plasmid. The variant El a defective in RB binding can be successfully transfected into the 
CHO-S cells at high ifrequency when expressed from strong promoters. 

[00150] For detection of El a expression, cells are fixed in phosphate-buffered saline 
containing 4% paraformaldehyde and stained with anti-El a antibody (M73; Santa Cruz 
Biotechnology) followed by FITC-conjugated secondary antibody. 

[00151] For analysis of transactivation, CHO-S cells are transfected with a reporter gene 
under the control of the hCMV-MIE promoter enhancer. The plasmid phrGFP-1 (Stratagene) 
expresses a modified green fluorescent protein (GFP) with reduced toxicity in mammalian cells 
under the control of the hCMV-MIE promoter. This plasmid is transiently transfected into CHO- 
S cells stably expressing El a genes and relative GFP levels are determined by flow cytometry 24 
- 48 hours post-transfection. For selection of stable GFP-expressing cell lines, a hygromycin 
selectable marker is introduced at the LoxP site in phrGFP-l using the EC-Hyg expression 
cassette (Stratagene) according to the manufacturer's instructions and CHO-S transfectants 
selected using hygromycin. 

[00152] Cell lines expressing Ela transactivate the hCMV promoter and lead to increased 
levels of expression of GFP both in transient and stable transfections compared with GFP levels 
in CHO-S cells lacking Ela activity. 

B. Example 2: Cloning and Expression of E1B-19K Protein 

[00153] E1B-19K coding sequence is cloned from Ad5 DNA by PCR amplification as 
described in Example 1 but using the following primers: 

Primer 8: CCCGAATTCGCCGCCACCATGGAGGCTT GGGAGTGTTT (SEQ ID NO: 13) 

Primer 9: CCCGTCGACCAACATTCATTCCCGAGGGT (SEQ ID NO: 14) 

100154] The PCR fragment is cloned between the EcoRl and Sail sites of pExchange-1 
(Stratagene) such that the gene is expressed from the hCMV promoter. The nucleotide sequence 
of the EcoRl - Sail DNA fragment is shown in Figiu-e 6. A selectable marker is inserted in the 
vector using LoxP mediated exchange (Stratagene) and the expression plasmid is used to 
transfect CHO-S cells as described in Example 1. 

[001551 For analysis of protein expression by immunofluorescence, cells are seeded on 
covershps, fixed in phosphate-buffered saline containing 4% paraformaldehyde, and the 
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preparations stored in phosphate-buffered sahne at 4°C. Cells are stained with rat monoclonal 
antibody to ElB-19-kDa (DP07L; EMD Biosciences) according to the manufacturer's 
instructions. The samples are then stained with fluorescein isothiocyanate (FITC)-conjugated 
secondary anti-rat antibodies and visualized with a fluorescence microscope. 
[00156] Cells expressing E1B-19K are transfected with wild-type or variant El a as 
described in Example 1. The E1B-19K expressing cells are resistant to apoptosis induction 
induced by El a. Transactivation by El a is determined as described in Example 1. 

C. Example 3: Cloning and Expression of variant hamster Bcl*2 

[00157] The hamster Bcl-2 cDNA is cloned by PCR amplification using standard 
techniques using amplification primers based on the known sequence of hamster Bcl-2 (Genbank 
AJ271720). PCR mutagenesis is used to construct the deletion variant Bcl-2 coding sequence 
shown in Figure 3. 

[00158] Standard recombinant polymerase chain reaction methodology is employed to 
insert oligonucleotides encoding the HA epitope, (M)AYPDYVPDYAV, at the 5 '-end of the 
protein-coding sequence of BCL-2 cDNA. The coding sequences are cloned into the vector pCI- 
neo (Promega) carrying a neomycin resistance gene or into a derivative vector carrying a 
hygromycin resistance gene. The authenticity of all constructs is verified by DNA sequencing. 
[00159] The vectors are transfected into CHO-S cells, using Lipofectamine (Invitrogen) 
according to the manufacturer's instructions and stable transfectants obtained by selection in 
G418 or hygromycin B and identified by Westem blot analysis and immunofluorescence. 
Western blots are carried out using antisera to HA Tag. 

[00160] Cells expressing recombinant Bcl-2 are transfected with Wild-type or variant El a 

as described in Example 1. The Bcl-2 expressing cells are resistant to apoptosis induction 
induced by El a. Transactivation by El a is determined as described in Example 1. 

D. Example 4: Cloning and expression of human RB 

[00161] The cDNA encoding human RB protein (Genbank sequence Ml 5400; Lee WH, 
Boolcstein R, Hong F, Young LJ, Shew JY, Lee EY (1987) Science. 235:1394-9) is cloned 
between the Sail and Notl sites in the expression vector pCI-neo. PCR primers used are: 
Primer. 10: CCCGTCGACGCCGCCACCATGCCGCCCA AAACCCCCCG (SEQ ID NO: 16) 
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Primer 11: CCCGCGGCCGCCGGTCCTGAGA TCCTCATTTC (SEQ ID NO: 17) 

[00162] The sequence of the constructed human RB coding sequence (Sail - Notl 
fragment; SEQ ID NO: 18) is shown in Figure 7. CHO-S cells are transfected as described in 
Example 1 and RB-transfected cells are detected by Western blotting using antibody XZ55 
(Pharmingen). 

E. Example 5: Interaction of Ela and RB in the Periplasm ofE. coli 

[00163] The bait fragment, is subcloned by ligating an EcoRI-MunI fragment (amino acids 
379 to 835) of Rb into the EcoRI site of pAS2 (Clontech). This fragment of Rb spans the A-B 
and C pockets of Rb that are necessary and sufficient for binding to Ela. The RB fragment is 
then cloned in-frame with a beta lactamase protein. Ela proteins are expressed in E. coli in 
fimctional form as described by Ferguson et al (1985) Mol Cell Biol. 5:2653-61). Ela - BLIP 
fusion proteins are constructed by insertion of Ela fragments N-terminal to BLIP. For this 
purpose, a fragment of the Ela coding sequence with the intron and CR3 deleted, is used. Thus 
the single Smal site can be used to define the C-terminus of the Ela fragment. Fragments of Ela 
are inserted in frame N-terminal to a BLIP fragment in an E. coli expression vector. 
[00164] On co-expression in E. coli, the ftision proteins are generated in the periplasm and 
association of Ela and RB will lead to inhibition of beta-lactamase activity. The bacteria are 
therefore sensitive to ampicillin. Mutants of Ela defective in RB association will confer 
resistance to ampicillin by release of BLIP from beta-lactamase. Variant clones are characterized 
by DNA sequencing. 

F. Example 6 - Cloning and Expression of a Mutant CREB 

[00165] Human cAMP response-element binding protein (CREB) exists in two forms, a 
longer form denoted CREB-A and a shorter altemative splice variant, CREB-B (Berkowitz and 
Oilman (1990) Proc. Natl. Acad. Sci. 87: 5258-5262). The cDNA encoding human CREB-A is 
cloned using PCR for insertion between EcoRl and Sail sites with the following primers: 

Primer 12: CCCGAATTCGCCGCCACCATGACCATGGACTCTGGAGCAGACA (SEQ ID NO: 22) 

Primer 13: GTCGACCCAAATTAATCTGATTTGTGGCAG (SEQ ID NO: 23) 

[00166] A variant CREB with constitutive transactivation activity (Du et al (2000) Mol. 

Cell. Biol. 20: 4320-4327) is generated by changing the Tyrosine residue at position 134 to 
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Phenylalanine (Y134F) by site-directed mutagenesis according to standard procedures. A single 
point mutation at nucleotide 374 in the sequence shown in Figure 8 (changing A to T) generates 
the Y134F mutation. 

[00167] The variant CREB coding sequence (Figure 8) can be cloned between EcoRl and 
Sal sites in the pCLieo expression vector and introduced into mammalian cells as described in 
Example 1. hi order to protect host cells from apoptosis induced by overexpression of CREB or 
variant CREB, a gene encoding a protective protein may also be introduced into the host cell as 
described in Examples 2 and 3. 

[00168] All pubUcations and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method and 
system of the invention will be apparent to those skilled in the art without departing fi-om the 
scope and spirit of the invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention as described should 
not be unduly limited to such specific embodiments. 
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V. Embodiments of the Invention : 

Embodiment A: A vector system for introducing nucleic acids into a mammalian host cell 
comprising: 

(a) a first cistron encoding a transactivator under control of a first promoter; 

(b) a second cistron encoding an apoptosis-protective protein; 

(c) a third cistron encoding at least one desired polypeptide xmder control of a second 
promoter, wherein said second promoter is responsive to the transactivator protein; and 

wherein the first, second, and third cistrons are contained in one or more vectors. 

. • The vector system of embodiment A, further comprising a fourth cistron encoding at least 
one additional desired polypeptide under control of a third promoter, wherein said third 
promoter is responsive to the transactivator protein. 

• The vector of embodiment A, further comprising a fourth cistron encoding at least one 
additional desired polypeptide under control of the second promoter, wherein the third 
cistron and fourth cistron are separated by an intemal ribosome entry site (IRES) element. 

• The vector system of embodiment A, wherein the first cistron and second cistron are 
under control of the same promoter. 

• The vector system of embodiment A, wherein the third and fourth cistron encode a heavy 
and light chain of an antibody or antibody fragment. 

• The vector system of embodiment A wherein the first cistron encodes a CREB or a 
variant thereof. 

• The vector system of embodiment A, wherein the first cistron is an adenoviral Ela 
cistron or a variant thereof, particularly wherein the adenoviral Ela cistron is from human 
Ad2, Ad5 orAdl2. 

• The vector system of embodiment A, wherein the first promoter is a RSV-LTR promoter 
or a cytomegalovirus promoter. 

• The vector system of embodiment A, wherein the second promoter comprises a CREB- 
binding element or a 19bp repeat from the hCMV-MIE enhancer. 
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• The vector system of embodiment A, wherein the second promoter comprises a TATAA 
transcription initiation signal, particularly a TATAA box region of the hCMV-MIE 
promoter. 

• The vector of embodiment A, wherein the third cistron encodes a cytokine, a growth 
hormone, or a single-chain antibody. 



Embodiment B: A method of enhancing the yield of a desired protein in a mammalian host cell 
comprising introducing to the mammalian host cell: 

(a) a first cistron encoding a transactivator xmder control of a first promoter; 

(b) a second cistron encoding an apoptosis-protective protein; and 

(c) a third cistron encoding a desired polypeptide under control of a second promoter, 
wherein said second promoter is responsive to the transactivator protein. 

• The method of embodiment B, wherein expression of the transactivator protein under 
control of the first promoter in the absence of the apoptosis-protective protein would 
cause significant cell death and expression of the apoptosis-protective protein prevents 
cell-killing due to expression of the transactivator. 

• The method of embodiment B, wherein the mammahan host cell is selected firom the 
group consisting of a CHO cell, a mouse myeloma cell, a mouse hybridoma cell, a rat 
myeloma cell, and a rat hybridoma cell. 

• The method of embodiment B, wherein the first, second and third cistrons are on one 
vector. 

• The method of embodiment B, wherein the first, second and third cistrons are introduced 
into the host cell on separate vectors. 

• The method of embodiment B, wherein the apoptosis-protective protein is selected fi-om 
the group consisting of a dominant negative mutant of p53, a protein that interacts with 
BAX, a protein that interacts with BAK, an adenovirus ElB 19K protein, an inhibitor of 
apoptosome formation, and a downstream apoptosis inhibitor. 
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• The method of embodiment B, wherein the apoptosis-protective protein is a variant Bcl-2 
protein having a deletion in the regulatory loop domain. 

• The method of embodiment B, wherein the first, second or third cistron is associated with 
a ubiquitous chromatin opening element, an insulator, a barrier element, an intron, a 
polyadenylation signal, a 5 '-untranslated region or a signal peptide, 

• The method of embodiment B, wherein the transactivator is an El a protein, a CREB 
protein, or a variant thereof. 

Embodiment C: A mammaUan host cell comprising a first cistron encoding a transactivator, a 
second cistron encoding an apoptosis-protective protein that prevents cell-killing due to 
expression of the transactivator, and a third cistron encoding one or more desired proteins under 
the control of a promoter responsive to the transactivator. 

• The mammalian host cell of embodiment C, wherein the mammalian host cell is not a 
human host cell. 

• The mammalian host cell of embodiment C, wherein the transactivator is expressed firom 
an efficient heterologous promoter at a level at which, in the absence of the protective 
protein, significant cell death would occur. 

• A method for producing a recombinant protein comprising cultxuing the mammalian host 
cell of embodiment C in a suitable medium such that the one or more desired proteins are 
secreted into the medium. 

Embodiment D: A mammalian host cell comprising a cistron encoding a variant El a protein and 
a cistron encoding at least one recombinant protein under the control of an activatable promoter, 
wherein the variant El a protein retains transactivation activity but is defective in the ability to 
trigger apoptosis. 
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• The mammalian host cell of embodiment D wherein the El a protein activates 
transcription from a promoter responsive to Ela-289 and is defective in binding to 
retinoblastoma protein. 

• The mammalian host cell of embodiment D further comprising a cistron that encodes an 
apoptosis-protective protein. 

Embodiment E: A method of expressing a desired polypeptide in a manmiaUan host cell 
comprising introducing to the mammalian host cell: 

(a) a first cistron encoding a variant El a protein under control of a first promoter, wherein 
the El a protein that retains transactivation activity but is defective in the ability to trigger 

apoptosis; and 

(b) a second cistron encoding a desired polypeptide under control of a second promoter, 
wherein said second promoter is responsive to the transactivator protein. 

• The method of embodiment E, further comprising introducing a third cistron encoding an 
apoptosis-protective protein to the host cell. 
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ABSTRACT 



The present invention is directed generally to compositions and methods for enhancing 
expression of desired proteins by a mammalian host cell using a co-expressed transcriptional 
activator. In particular, the invention provides vectors, host cells and methods of expressing at 
least one desired polypeptide by transfecting a mammalian host cell with cistrons encoding a 
transactivator, a desired polypeptide and an apoptosis-protective protein. The invention also 
provides vectors, host cells and methods of expressing desired polypeptides by stably 
transfecting a mammalian host cell with a cistron encoding a variant El a protein that retains 
transactivation activity but is defective in the abiUty to trigger apoptosis. 
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MAQAGRTGYDNREIVMKYIHYKLSQRGYEWDVGDVDAAPLGAA 
PTPGIFSFQPESNPTPAVHRDMAARTSPLRPIVATTGPTLSPV 
PPWHLTLRRAGDDFSRRYRRDFAEMSSQLHLTPFTARGRFAT 
WEELFRDGVNWGRIVAFFEFGGVMCVESVNREMSPLVDNIAL 
WMTEYLNRHLHTWIQDNGGWDAFVELYGPSVRPLFDFSWLSLK 
TLLSLALVGACITLGTYLGHK (SEQ ID NO: 1) 



FIG. 1 



MAQAGRTGYDNRE I VMKYIHYKLSQRGYEWDVGDVDAAAAAAS 
PVPPWHLTLRRAGDDFSRRYRRDFAEMSSQLHLTPFTARGRF 
ATWEELFRDGVNWGRIVAFFEFGGVMCVESVNREMSPLVDNI 
ALWMTEYLNRHLHTWIQDNGGWDAFVELYGPSVRPLFDFSWLS 
LKTLLSLALVGACITLGTYLGHK (SEQ ID NO: 2) 



FIG. 2 



1 ATGGCTCAAG CTGGGAGAAC AGGGTATGAT AACCGAGAGA TCGTGATGAA 

51 GTACATCCAT TATAAGCTGT CACAGAGGGG CTACGAGTGG GATGTGGGAG 

101 ATGTGGACGC CGCGGCCGCG GCCGCGAGCC CCGTGCCACC TGTGGTCCAC 

151 CTGACCCTCC GCCGGGCTGG GGATGACTTC TCCCGTCGCT ACCGTCGCGA 

201 CTTCGCGGAG ATGTCCAGTC AGCTGCACCT GACGCCCTTC ACCGCGAGGG 

251 GACGCTTTGC TACGGTGGTG GAGGAACTCT TCAGGGATGG GGTGAACTGG 

301 GGGAGGATTG TGGCCTTCTT TGAGTTCGGT GGGGTCATGT GTGTGGAGAG 

351 CGTCAACAGG GAGATGTCAC CCCTGGTGGA CAACATCGCC CTGTGGATGA 

4 01 CCGAGTACCT GAACCGGCAT CTGCACACCT GGATCCAGGA TAACGGAGGC 

451 TGGGACGCAT TTGTGGAACT GTACGGCCCC AGTGTGAGGC CTCTGTTTGA 

501 TTTCTCTTGG CTGTCTCTGA AGACCCTGCT CAGCCTGGCC CTGGTCGGGG 

551 CCTGCATCAC TCTGGGTACC TACCTGGGCC ACAAGTGA (SEQ ID NO: 3) 



FIG. 3 



MRHIICHGGVITEEMAASLLDQLIEEVLADNLPPPSHFEPPTL 
HELHDLDVTAPEDPNEEAVSQIFPDSVMLAVQEGIDLLTFPPA 
PGSPEPPHLSRQPEQPEQRALGPVSMPNLVPEVIDLTGHEAGF 
PPSDDEDEEGEEFVLDYVEHPGHGCRSCHYHRRNTGDPDIMCS 

LCYMRTCGMFVYSPVSEPEPEPEPEPEPARPTRRPKMAPAILR 
RPTSPVSRECNSSTDSCDSGPSNTPPEIHPWPLCPIKPVAVR 
VGGRRQAVECIEDLLNEPGQPLDLSCKRPRP (SEQ ID NO: 4) 
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1 ATGAGACATA TTATCTGCCA CGGAGGTGTT ATTACCGAAG AAATGGCCGC 
51 CAGTCTTTTG GACCAGCTGA TCGAAGAGGT ACTGGCTGAT AATCTTCCAC 
101 CTCCTAGCCA TTTTGAACCA CCTACCCTTC ACGAACTGCA TGATTTAGAC 
151 GTGACGGCCC CCGAAGATCC CAACGAGGAG GCGGTTTCGC AGATTTTTCC 
201 CGACTCTGTA ATGTTGGCGG TGCAGGAAGG GATTGACTTA CTCACTTTTC 
251 CGCCGGCGCC CGGTTCTCCG GAGCCGCCTC ACCTTTCCCG GCAGCCCGAG 
301 CAGCCGGAGC AGAGAGCCTT GGGTCCGGTT TCTATGCCAA ACCTTGTACC 
351 GGAGGTGATC GATCTTACCG GCCACGAGG.C TGGCTTTCCA CCCAGTGACG 
401 ACGAGGATGA AGAGGGTGAG GAGTTTGTGT TAGATTATGT GGAGCACCCC 
451 GGGCACGGTT GCAGGTCTTG TCATTATCAC CGGAGGAATA CGGGGGACCC 
501 AGATATTATG TGTTCGCTTT GCTATATGAG GACCTGTGGC ATGTTTGTCT 
551 ACAGTAAGTG AAAATTATGG GCAGTGGGTG ATAGAGTGGT GGGTTTGGTG 
601 TGGTAATTTT TTTTTTAATT TTTACAGTTT TGTGGTTTAA AGAATTTTGT ■ 
651 ATTGTGATTT TTTTAAAAGG TCCTGTGTCT GAACCTGAGC CTGAGCCCGA 
701 GCCAGAACCG GAGCCTGCAA GACCTACCCG CCGTCCTAAA ATGGCGCCTG 
751 CTATCCTGAG ACGCCCGACA TCACCTGTGT CTAGAGAATG CAATAGTAGT 
801 ACGGATAGCT GTGACTCCGG TCCTTCTAAC ACACCTCCTG AGATACACCC 
851 GGTGGTCCCG CTGTGCCCCA TTAAACCAGT TGCCGTGAGA GTTGGTGGGC 
901 GTCGCCAGGC TGTGGAATGT ATCGAGGACT TGCTTAACGA GCCTGGGCAA 
951 CCTTTGGACT TGAGCTGTAA ACGCCCCAGG CCATAA (SEQ ID NO: 5) 
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1 GAATTCGCCG CCACCATGGA GGCTTGGGAG TGTTTGGAAG ATTTTTCTGC 
51 TGTGCGTAAC TTGCTGGAAC AGAGCTCTAA CAGTACCTCT TGGTTTTGGA 
101 GGTTTCTGTG GGGCTCATCC CAGGCAAAGT TAGTCTGCAG AATTAAGGAG 
151 GATTACAAGT GGGAATTTGA AGAGCTTTTG AAATCCTGTG GTGAGCTGTT 
201 TGATTCTTTG AATCTGGGTC ACCAGGCGCT TTTCCAAGAG AAGGTCATCA 
251 AGACTTTGGA TTTTTCCACA CCGGGGCGCG CTGCGGCTGC TGTTGCTTTT 
301 TTGAGTTTTA TAAAGGATAA ATGGAGCGAA GAAACCCATC TGAGCGGGGG 
351 GTACCTGCTG GATTTTCTGG CCATGCATCT GTGGAGAGCG GTTGTGAGAC 
401 ACAAGAATCG CCTGCTACTG TTGTCTTCCG TCCGCCCGGC GATAATACCG 
451 ACGGAGGAGC AGCAGCAGCA GCAGGAGGAA GCCAGGCGGC GGCGGCAGGA 
501 GCAGAGCCCA TGGAACCCGA GAGCCGGCCT GGACCCTCGG GAATGAATGT 
551 TGGTCGAC (SEQ ID NO: 15) 
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Sail 

1 GTCGACGCCG CCACCATGCC GCCCAAAACC CCCCGAAAAA CGGCCGCCAC 

MP PKT PRKT AAT 

51 CGCCGCCGCT GCCGCCGCGG AACCCCCGGC ACCGCCGCCG CCGCCCCCTC 
AAA AAAE PPA PPP PPPP 

101 CTGAGGAGGA CCCAGAGCAG GACAGCGGCC CGGAGGACCT GCCTCTCGTC 
EED PEQ DSGP EDL PLV 

151 AGGCTTGAGT TTGAAGAAAC AGAAGAACCT GATTTTACTG CATTATGTCA 
RLEF EET EEP DFTA LCQ 

201 GAAATTAAAG ATACCAGATC ATGTCAGAGA GAGAGCTTGG TTAACTTGGG 
KLK IPDH VRE RAW LTWE 

251 AGAAAGTTTC ATCTGTGGAT GGAGTATTGG GAGGTTATAT TCAAAAGAAA 
KVS SVD GVLG GYI QKK 

301 AAGGAACTGT GGGGAATCTG TATCTTTATT GCACGAGTTG ACCTAGATGA 
KELW GIC IFI ARVD LDE 

351 GATGTCGTTC ACTTTACTGA GCTACAGAAA AACATACGAA ATCAGTGTCC 
MSP TLLS YRK TYE ISVH 

401 ATAAATTCTT TAACTTACTA AAAGAAATTG ATACCAGTAC CAAAGTTGAT 
KFF NLL KEID TST KVD 

451 AATGCTATGT CAAGACTGTT GAAGAAGTAT GATGTATTGT TTGCACTCTT 
NAMS RLL KKY DVLF ALF 

501 CAGCAAATTG GAAAGGACAT GTGAACTTAT ATATTTGACA CAACCCAGCA 
SKL ERTC ELI YLT QPSS 

551 GTTCGATATC TACTGAAATA AATTCTGCAT TGGTGCTAAA AGTTTCTTGG 
S IS TEI NSAL VLK VSW 

601 ATCACATTTT TATTAGCTAA AGGGGAAGTA TTACAAATGG AAGATGATCT 
ITFL LAK GEV LQME DDL 

651 GGTGATTTCA TTTCAGTTAA TGCTATGTGT CCTTGACTAT TTTATTAAAC 
VIS FQLM LCV LD.Y FIKL 
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701 TCTCACCTCC CATGTTGCTC AAAGAACCAT ATAAAACAGC TGTTATACCC 
SPP MLL KEPY KTA VIP 

751 ATTAATGGTT CACCTCGAAC ACCCAGGCGA GGTCAGAACA GGAGTGCACG 
INGS PRT PRR GQNR S AR 

801 GATAGCAAAA CAACTAGAAA ATGATACAAG AATTATTGAA GTTCTCTGTA 
lAK QLEN DTK IIE V LCK 

851 AAGAACATGA ATGTAATATA GATGAGGTGA AAAATGTTTA TTTCAAAAAT 
EHE CNI DEVK NVY FKN 

EcoRI 

901 TTTATACCTT TTATGAATTC TCTTGGACTT GTAACATCTA ATGGACTTCC 
FIPF MNS LGL VTSN GLP 

■951 AGAGGTTGAA AATCTTTCTA AACGATACGA AGAAATTTAT CTTAAAAATA 
EVE NLSK RYE EIY LKNK 

1001 AAGATCTAGA TCGAAGATTA TTTTTGGATC ATGATAAAAC TCTTCAGACT 
DLD RRL FLDH DKT LQT 

1051 GATTCTATAG ACAGTTTTGA AACACAGAGA ACACCACGAA AAAGTAACCT 
DSID SFE TQR TPRK SNL 

1101 TGATGAAGAG GTGAATATAA TTCCTCCACA CACTCCAGTT AGGACTGTTA 
DEE VNII PPH TPV RTVM 

1151 TGAACACTAT CCAACAATTA ATGATGATTT TAAATTCTGC AAGTGATCAA 
NTI QQL MMIL NSA SDQ 

1201 CCTTCAGAAA ATCTGATTTC CTATTTTAAC AACTGCACAG TGAATCCAAA 
PSEN LIS YFN NCTV NP K 

1251 AGAAAGTATA CTGAAAAGAG TGAAGGATAT AGGATACATC TTTAAAGAGA 
ESI LKRV KDI GYI FKEK 

1301 AATTTGCTAA AGCTGTGGGA CAGGGTTGTG TCGAAATTGG ATCACAGCGA 
FAK AVG QGCV EIG SQR 

1351 TACAAACTTG GAGTTCGCTT GTATTACCGA GTAATGGAAT CCATGCTTAA 
YKLG VRL YYR VMES MLK 
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1401 ATCAGAAGAA GAACGATTAT CCATTCAAAA TTTTAGCAAA CTTCTGAATG 
SEE ERLS I QN FSK LLND 

1451 ACAACATTTT TCATATGTCT TTATTGGCGT GCGCTCTTGA GGTTGTAATG 
NIF HMS LLAC AL E VVM 

1501 GCCACATATA GCAGAAGTAC ATCTCAGAAT CTTGATTCTG GAACAGATTT 
ATYS RST SQN LDSG TDL 

1551 GTCTTTCCCA TGGATTCTGA ATGTGCTTAA TTTAAAAGCC TTTGATTTTT 
SFP WILN VLN LKA FDFY 

1601 ACAAAGTGAT CGAAAGTTTT ATCAAAGCAG AAGGCAACTT GACAAGAGAA 
KVI ESF IKAE GN L TRE 

1651 ATGATAAAAC ATTTAGAACG ATGTGAACAT CGAATCATGG AATCCCTTGC 
MIKH LER CEH RIME SLA 

1701 ATGGCTCTCA GATTCACCTT TATTTGATCT TATTAAACAA TCAAAGGACC 
WLS DSPL FDL IKQ SKDR 

1751 GAGAAGGACC AACTGATCAC CTTGAATCTG CTTGTCCTCT TAATCTTCCT 
EGP TDH LESA CPL NLP 

1801 CTCCAGAATA ATCACACTGC AGCAGATATG TATCTTTCTC CTGTAAGATC 
LQNN HTA ADM YLSP VRS 

1851 TCCAAAGAAA AAAGGTTCAA CTACGCGTGT AAATTCTACT GCAAATGCAG 
PKK KGST TR V NST ANAE 

1901 AGACACAAGC AACCTCAGCC TTCCAGACCC AGAAGCCATT GAAATCTACC 
TQA TSA FQTQ KPL KST 

1951 TCTCTTTCAC TGTTTTATAA AAAAGTGTAT CGGCTAGCCT ATCTCCGGCT 
SLSL FYK KVY RLAY LRL 

2001 AAATACACTT TGTGAACGCC TTCTGTCTGA GCACCCAGAA TTAGAACATA 
NTL CERL LSE HPE LEHI 

2051 TCATCTGGAC CCTTTTCCAG CACACCCTGC AGAATGAGTA TGAACTCATG 
IWT.LFQ HTLQ NEY ELM 
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2101 AGAGACAGGC ATTTGGACCA AATTATGATG TGTTCCATGT ATGGCATATG 
RDRH LDQ IMM CSMY GIC 

2151 CAAAGTGAAG AATATAGACC TTAAATTCAA AATCATTGTA ACAGCATACA 
KVK NIDL KFK IIV TAYK 



2201 AGGATCTTCC TCATGCTGTT CAGGAGACAT TCAAACGTGT TTTGATCAAA 
DLP HAV QETF KRV LIK 

2251 GAAGAGGAGT ATGATTCTAT TATAGTATTC TATAACTCGG TCTTCATGCA 
EEEY DSI IVF YNSV FM Q 



23 01 GAGACTGAAA ACAAATATTT TGCAGTATGC TTCCACCAGG CCCCCTACCT 
RLK TNIL QYA STR PPTL 

2351 TGTCACCAAT ACCTCACATT CCTCGAAGCC CTTACAAGTT TCCTAGTTCA 
SPI PHI PRSP YKF PSS 



2401 CCCTTACGGA TTCCTGGAGG GAACATCTAT ATTTCACCCC TGAAGAGTCC 
PLRI PGG NIY ISPL KSP 



2451 ATATAAAATT TCAGAAGGTC TGCCAACACC AACAAAAATG ACTCCAAGAT 
YKI SEGL PTP TKM TPRS 



2501 CAAGAATCTT AGTATCAATT GGTGAATCAT TCGGGACTTC TGAGAAGTTC 
RIL VSI GESF GTS EKF 



2551 CAGAAAATAA ATCAGATGGT ATGTAACAGC GACCGTGTGC TCAAAAGAAG 
QKIN QMV CNS DRVL KRS 

2601 TGCTGAAGGA AGCAACCCTC CTAAACCACT GAAAAAACTA CGCTTTGATA 
AEG SNPP KPL KKL RFDI 



2 651 TTGAAGGATC AGATGAAGCA GATGGAAGTA AACATCTCCC AGGAGAGTCC 
EGS DEA DGSK HLP GES 

2701 AAATTTCAGC AGAAACTGGC AGAAATGACT TCTACTCGAA CACGAATGCA 
KFQQ KLA EMT STRT RMQ 

2751 AAAGCAGAAA ATGAATGATA GCATGGATAC CTCAAACAAG GAAGAGAAAT 
KQK MNDS MDT SNK EEK* 



Not I 

2801 GAGGATCTC7V GGACCGGCGG CCGC 
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EcoRI 

1 GAATTCGCCG CCACCATGAC CATGGACTCT GGAGCAGACA ACCAGCAGAG 

MT MDS GADN QQS 

51 TGGAGATGCA GCTGTAACAG AAGCTGAAAA CCAACAAATG ACAGTTCAAG 
GDA AVTE AEN QQM TVQA 

101 CCCAGCCACA GATTGCCACA TTAGCCCAGG TATCTATGCC AGCAGCTCAT 
QPQ lAT LAQV SMP AAH 

151 GCAACATCAT CTGCTCCCAC CGTAACTCTA GTACAGCTGC CCAATGGGCA 
ATSS APT VTL VQLP NGQ 

201 GACAGTTCAA GTCCATGGAG TCATTCAGGC GGCCCAGCCA TCAGTTATTC 
TVQ VHGV IQA AQP SVIQ 

251 AGTCTCCACA AGTCCAAACA GTTCAGATTT CAACTATTGC AGAAAGTGAA 
SPQ VQT VQIS TIA ESE 

301 GATTCACAGG AGTCAGTGGA TAGTGTAACT GATTCCCAAA AGCGAAGGGA 
DSQE SVD SVT DSQK RRE 

351 AATTCTTTCA AGGAGGCCTT CCTTCAGGAA AATTTTGAAT GACTTATCTT 
ILS RRPS PRK ILN DLSS 

401 CTGATGCACC AGGAGTGCCA AGGATTGAAG AAGAGAAGTC TGAAGAGGAG 
DAP GVP RIEE EKS EEE 

451 GCTTCAGCAC CTGCCATCAC CGCTGTAGCG GTGCCAACGC CAATTTACCG 
ASAP AIT AVA VPTP lYR 

501 GACTAGCAGT GGACAGTATA TTACCATTAC CCAGAGAGGA GCAATACAGC 
TSS GQYI TIT QRG AIQL 

551 TGGCTAGCAA TGGTACCGAT GGGGTACAGG GCCTGCAAAC ATTAACCATG 
ASN GTD GVQG LQT LTM 

601 GCCAATGCAG CAGCCACTCA GCCGGGTACT ACCATTCTAC AGTATGCACA 
ANAA ATQ PGT TILQ YAQ 

651 GACCACTGAT GGACAGCAGA TCTTAGTGCC CAGCAACCAA GTTGTTGTTC 
TTD GQQI LVP SNQ VVVQ 
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701 AAGCTGCCTC TGGAGACGTA CAAACATACC AGATTCGCAC AGCACCCACT 
AAS GDV QTYQ IRT APT 

751 AGCACTATTG CCCCTGGAGT TGTTATGGCA TCCTCCCCAG CACTTCCTAC 
STIA PGV VMA SSPA LPT 

801 ACAGCCTGCT GAAGAAGCAG CACGAAAGAG ' AGAGGTCCGT CTAATGAAGA 
QPA EEAA RKR EVR LMKN 

851 ACAGGGAAGC AGCTGGTGAG TGTCGTAGAA AGAAGAAAGA ATATGTGAAA 
REA ARE CRRK KKE YVK 

901 TGTTTAGAAA ACAGAGTGGC AGTGCTTGAA AATCAAAACA AGACATTGAT 
G LEN RVA VL E NQ NK TLI 

951 TGAGGAGCTA AAAGCACTTA AGGACCTTTA CTGCCACAAA TCAGATTAAT 
EEL KALK DLY CHK SD* 

Sail 
1001 TTGGGTCGAC 
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SEQUENCE LISTING 



<110> Celscia Therapeutics, Inc. 

<120> TRANSACTIVATION SYSTEM FOR MAMMALIAN CELLS 

<130> CELA-OOl/OOUS 

<160> 23 

<170> Patentin version 3.2 

<210> 1 

<211> 236 

<212> PRT 

<213> hamster 

<400> 1 

Met Ala Gin Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu lie Val Met 
15 10 15 



Lys Tyr lie His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Val 
20 25 30 



Gly Asp Val Asp Ala Ala Pro Leu Gly Ala Ala Pro Thr Pro Gly lie 
35 40 45 



Phe Ser Phe Gin Pro Glu Ser Asn Pro Thr Pro Ala Val His Arg Asp 
50 55 60 



Met Ala Ala Arg Thr Ser Pro Leu Arg Pro lie Val Ala Thr 'Thr Gly 
65 70 75 80 



Pro Thr Leu Ser Pro Val Pro Pro Val Val His Leu Thr Leu Arg Arg 
85 90 95 



Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg Arg Asp Phe Ala Glu Met 
100 105 110 



Ser Ser Gin Leu His Leu Thr Pro Phe Thr Ala Arg Gly Arg Phe Ala 
115 120 125 



Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp Gly Arg lie 
130 135 140 



Val Ala Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu Ser Val Asn 
145 150 155 160 
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Arg Glu Met Ser Pro Leu Val Asp Asn lie Ala Leu Trp Met Thr Glu 
165 170 175 



Tyr Leu Asn. Arg His Leu His Thr Trp lie Gin Asp Asn Gly Gly Trp 
180 185 190 



Asp Ala Phe Val Glu Leu Tyr Gly Pro Ser Val Arg Pro Leu Phe Asp 
195 200 205 



Phe Ser Trp Leu Ser Leu Lys Thr Leu Leu Ser Leu Ala Leu Val Gly 
210 215 220 



Ala Cys lie Thr Leu Gly Thr Tyr Leu Gly His Lys 
225 230 235 



<210> 2 

<211> 195 

<212> PRT 

<213> hamster 

<400> 2 

Met Ala Gin Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu lie Val Met 
1 5 10 . 15 



Lys Tyr lie His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Val 
20 25 30 



Gly Asp Val Asp Ala Ala Ala Ala Ala Ala Ser Pro Val Pro Pro Val 
35 40 45 



Val His Leu Thr Leu Arg Arg Ala Gly Asp Asp Phe Ser Arg Arg Tyr 
50 55 60 



Arg Arg Asp Phe Ala Glu Met Ser Ser Gin Leu His Leu Thr Pro Phe 
65 70 75 80 



Thr Ala Arg Gly Arg Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp 
85 90 95 



Gly Val Asn Trp Gly Arg He Val Ala Phe Phe Glu Phe Gly Gly Val 
100 105 110 
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Met Cys Val Glu Ser Val Asn Arg Glu Met Ser Pro Leu Val Asp Asn 
115 120 125 



lie Ala Leu Trp Met Thr Glu Tyr Leu Asn Arg His Leu His Thr Trp 
130 135 140 



lie Gin Asp Asn Gly Gly Trp Asp Ala Phe Val Glu Leu Tyr Gly Pro 
145 150 155 160 



Ser Val Arg Pro Leu Phe Asp Phe Ser Trp Leu Ser Leu Lys Thr Leu 
165 170 175 



Leu Ser Leu Ala Leu Val Gly Ala Cys lie Thr Leu Gly Thr Tyr Leu 
180 185 190 



Gly His Lys 
195 



<210> 3 

<211> 588 

<212> DNA 

<213> hamster 

<400> 3 



atggctcaag 


ctgggagaac 


agggtatgat 


aaccgagaga 


tcgtgatgaa 


gtacatccat 


60 


tataagctgt 


cacagagggg 


ctacgagtgg 


gatgtgggag 


atgtggacgc 


cgcggccgcg 


120 


gccgcgagcc 


ccgtgccacc 


tgtggtccac 


ctgaccctcc 


gccgggctgg 


ggatgacttc 


180 


tcccgtcgct 


accgtcgcga 


cttcgcggag 


atgtccagtc 


agctgcacct 


gacgcccttc 


240 


accgcgaggg 


gacgctttgc 


tacggtggtg 


gaggaactct 


tcagggatgg 


ggtgaactgg 


300 


gggaggattg 


tggccttctt 


tgagttcggt 


ggggtcatgt 


gtgtggagag 


cgtcaacagg 


360 


gagatgtcac 


ccctggtgga 


caacatcgcc 


ctgtggatga 


ccgagtacct 


gaaccggcat 


420 


ctgcacacct 


ggatccagga 


taacggaggc 


tgggacgcat 


ttgtggaact 


gtacggcccc 


480 


agtgtgaggc 


ctctgtttga 


tttctcttgg 


ctgtctctga 


agaccctgct 


cagcctggcc 


540 


ctggtcgggg 


cctgcatcac 


tctgggtacc 


tacctgggcc 


acaagtga 
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<210> 4 

<211> 289 

<212> PRT 

<213> Human adenovirus type 5 

<400> 4 



Met Arg His He He Cys His Gly Gly Val He Thr Glu Glu Met Ala 
15 10 15 



Ala Ser Leu Leu Asp Gin Leu He Glu Glu Val Leu Ala Asp Asn Leu 
20 25 30 



Pro Pro Pro Ser His Phe Glu Pro Pro Thr Leu His Glu Leu His Asp 
35 40 45 



Leu Asp Val Thr Ala Pro Glu Asp Pro Asn Glu Glu Ala Val Ser Gin 
50 55 60 



He Phe Pro Asp Ser Val Met Leu Ala Val Gin Glu Gly He Asp Leu 
65 70 75 80 



Leu Thr Phe Pro Pro Ala Pro Gly Ser Pro Glu Pro Pro His Leu Ser 
85 90 95 



Arg Gin Pro Glu Gin Pro Glu Gin Arg Ala Leu Gly Pro Val Ser Met 
100 105 110 



Pro Asn Leu Val Pro Glu Val He Asp Leu Thr Gly His Glu Ala Gly 
115 120 125 



Phe Pro Pro Ser Asp Asp Glu Asp Glu Glu Gly Glu Glu Phe Val Leu 
130 135 140 



Asp Tyr Val Glu His Pro Gly His Gly Cys Arg Ser Cys His Tyr His 
145 150 155 160 



Arg Arg Asn Thr Gly Asp Pro Asp He Met Cys Ser Leu Cys Tyr Met 
165 170 175 



Arg Thr Cys Gly Met Phe Val Tyr Ser Pro Val Ser Glu Pro Glu Pro 
180 185 190 



Glu Pro Glu Pro Glu Pro Glu Pro Ala Arg Pro Thr Arg Arg Pro Lys 
195 200 205 



Met Ala Pro Ala He Leu Arg Arg Pro Thr Ser Pro Val Ser Arg Glu 
210 215 220 
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Cys Asn Ser Ser Thr Asp Ser Cys Asp Ser Gly 
225 230 235 



Pro Ser Asn Thr Pro 
240 



Pro Glu lie His Pro Val Val Pro Leu Cys Pro lie Lys Pro Val Ala 
245 250 255 

Val Arg Val Gly Gly Arg Arg Gin Ala Val Glu Cys lie Glu Asp Leu 
260 265 270 

Leu Asn Glu Pro Gly Gin Pro Leu Asp Leu Ser Cys Lys Arg Pro Arg 
275 280 285 

Pro 

<210> 5 
<211> 986 
<212> DNA 

<213> Human adenovirus type 5 
<400> 5 



atgagacata 


ttatctgcca 


cggaggtgtt 


attaccgaag 


aaatggccgc 


cagtcttttg 


60 


gaccagctga 


tcgaagaggt 


actggctgat 


aatcttccac 


ctcctagcca 


ttttgaacca 


120 


cctacccttc 


acgaactgca 


tgatttagac 


gtgacggccc 


ccgaagatcc 


caacgaggag 


180 


gcggtttcgc 


agatttttcc 


cgactctgta 


atgttggcgg 


tgcaggaagg 


gattgactta 


240 


ctcacttttc 


cgccggcgcc 


cggttctccg 


gagccgcctc 


acctttcccg 


gcagcccgag 


300 


cagccggagc 


agagagcctt 


gggtccggtt 


tctatgccaa 


accttgtacc 


ggaggtgatc 


360 


gatcttaccg 


gccacgaggc 


tggctttcca 


cccagtgacg 


acgaggatga 


agagggtgag 


420 


gagtttgtgt 


tagattatgt 


ggagcacccc 


gggcacggtt 


gcaggtcttg 


tcattatcac 


480 


cggaggaata 


cgggggaccc 


agatattatg 


tgttcgcttt 


gctatatgag 


gacctgtggc 


540 


atgtttgtct 


acagtaagtg 


aaaattatgg 


gcagtgggtg 


atagagtggt 


gggtttggtg 


600 


tggtaatttt 


ttttttaatt 


tttacagttt 


tgtggtttaa 


agaattttgt 


attgtgattt 


660 


ttttaaaagg 


tcctgtgtct 


gaacctgagc 


ctgagcccga 


gccagaaccg 


gagcctgcaa 


720 


gacctacccg 


ccgtcctaaa 


atggcgcctg 


ctatcctgag 


acgcccgaca 


tcacctgtgt 


780 


ctagagaatg 


caatagtagt 


acggatagct 


gtgactccgg 


tccttctaac 


acacctcctg 


840 


agatacaccc 


ggtggtcccg 


ctgtgcccca 


ttaaaccagt 


tgccgtgaga 


gttggtgggc 


900 


gtcgccaggc 


tgtggaatgt 


atcgaggact 


tgcttaacga 


gcctgggcaa 


cctttggact 


960 
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tgagctgtaa acgccccagg ccataa 



<210> 6 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 6 

cccgaattcg ccgccaccat gagacatatt atctgccac 



<210> 7 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 7 

cccgtcgacc ttatggcctg gggcgttt 



<210> 8 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 8 

cccggtaccg ccgccaccat gagacatatt atctgccac 



<210> 9 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 9 

cccgcggccg ccttatggcc tggggcgttt 



<210> 10 

<211> 25 

<212> DNA 

<213> Artificial 
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<220> 

<223> primer 
<400> 10 

ggaggtgatc gatcttaccg gccac 



<210> 11 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 11 

cctacccttc acgaactgca tgatttagac gtgacg 



<210> 12 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 12 

cgtcacgtct aaatcatgca gttcgtgaag ggtagg 

<210> 13 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 13 

cccgaattcg ccgccaccat ggaggcttgg gagtgttt 



<210> 14 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 14 

cccgtcgacc aacattcatt cccgagggt 

<210> 15 

<211> 558 



58 



<212> DNA 

<213> Human adenovirus type 5 
<400> 15 

gaattcgccg ccaccatgga ggcttgggag tgtttggaag atttttctgc tgtgcgtaac 
ttgctggaac agagctctaa cagtacctct tggttttgga ggtttctgtg gggctcatcc 
caggcaaagt tagtctgcag aattaaggag gattacaagt gggaatttga agagcttttg 
aaatcfctgtg gtgagctgtt tgattctttg aatctgggtc accaggcgct tttccaagag 
aaggtcatca agactttgga tttttccaca ccggggcgcg ctgcggctgc tgttgctttt 
ttgagtttta taaaggataa atggagcgaa gaaacccatc tgagcggggg gtacctgctg 
gattttctgg ccatgcatct gtggagagcg gttgtgagac acaagaatcg cctgctactg 
ttgtcttccg tccgcccggc gataataccg acggaggagc agcagcagca gcaggaggaa 
gccaggcggc ggcggcagga gcagagccca tggaacccga gagccggcct ggaccctcgg 
gaatgaatgt tggtcgac 



<210> 16 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 16 

cccgtcgacg ccgccaccat gccgcccaaa accccccg 



<210>'- 17 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 17 

cccgcggccg ccggtcctga gatcctcatt tc 



<210> 18 

<211> 2824 

<212> DNA 

<213> Homo sapiens 

<400> 18 

gtcgacgccg ccaccatgcc gcccaaaacc ccccgaaaaa cggccgccac cgccgccgct 
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gccgccgcgg 


aacccccggc 


accgccgccg 


ccgccccctc 


ctgaggagga 


cccagagcag 


120 


gacagcggcc 


cggaggacct 


gcctctcgtc 


aggcttgagt 


ttgaagaaac 


agaagaacct 


180 


gattttactg 


cattatgtca 


gaaattaaag 


ataccagatc 


atgtcagaga 


gagagcttgg 


240 


ttaacttggg 


agaaagtttc 


atctgtggat 


ggagtattgg 


gaggttatat 


tcaaaagaaa 


300 


aaggaactgt 


ggggaatctg 


tatctttatt 


gcacgagttg 


acctagatga 


gatgtcgttc 


360 


actttactga 


gctacagaaa 


aacatacgaa 


atcagtgtcc 


ataaattctt 


taacttacta 


420 


aaagaaattg 


ataccagtac 


caaagttgat 


aatgctatgt 


caagactgtt 


gaagaagtat 


480 


gatgtattgt 


ttgcactctt 


cagcaaattg 


gaaaggacat 


gtgaacttat 


atatttgaca 


540 


caacccagca 


gttcgatatc 


tactgaaata 


aattctgcat 


tggtgctaaa 


agtttcttgg 


600 


atcacatttt 


tattagctaa 


aggggaagta 


ttacaaatgg 


aagatgatct 


ggtgatttca 


660 


tttcagttaa 


tgctatgtgt 


ccttgactat 


tttattaaac 


tctcacctcc 


catgttgctc 


720 


aaagaaccat 


ataaaacagc 


tgttataccc 


attaatggtt 


cacctcgaac 


acccaggcga 


780 


ggtcagaaca 


ggagtgcacg 


gatagcaaaa 


caactagaaa 


atgatacaag 


aattattgaa 


840 


gttctctgta 


aagaacatga 


atgtaatata 


gatgaggtga 


aaaatgttta 


tttcaaaaat 


900 


tttatacctt 


ttatgaattc 


tcttggactt 


gtaacatcta 


atggacttcc 


agaggttgaa 


960 


aatctttcta 


aacgatacga 


agaaatttat 


cttaaaaata 


aagatctaga 


tcgaagatta 


1020 


tttttggatc 


atgataaaac 


tcttcagact 


gattctatag 


acagttttga 


aacacagaga 


1080 


acaccacgaa 


aaagtaacct 


tgatgaagag 


gtgaatataa 


ttcctccaca 


cactccagtt 


1140 


aggactgtta 


tgaacactat 


ccaacaatta 


atgatgattt 


taaattctgc 


aagtgatcaa 


1200 


ccttcagaaa 


atctgatttc 


ctattttaac 


aactgcacag 


tgaatccaaa 


agaaagtata 


1260 


ctgaaaagag 


tgaaggatat 


aggatacatc 


tttaaagaga 


aatttgctaa 


agctgtggga 


1320 


cagggttgtg 


tcgaaattgg 


atcacagcga 


tacaaacttg 


gagttcgctt 


gtattaccga 


1380 


gtaatggaat 


ccatgcttaa 


atcagaagaa 


gaacgattat 


ccattcaaaa 


ttttagcaaa 


1440 


cttctgaatg 


acaacatttt 


tcatatgtct 


ttattggcgt 


gcgctcttga 


ggttgtaatg 


1500 


gccacatata 


gcagaagtac 


atctcagaat 


cttgattctg 


gaacagattt 


gtctttccca 


1560 


tggattctga 


atgtgcttaa 


tttaaaagcc 


tttgattttt 


acaaagtgat 


cgaaagtttt 


1620 


atcaaagcag 


aaggcaactt 


gacaagagaa 


atgataaaac 


atttagaacg 


atgtgaacat 


1680 


cgaatcatgg 


aatcccttgc 


atggctctca 


gattcacctt 


tatttgatct 


tattaaacaa 


1740 


tcaaaggacc 


gagaaggacc 


aactgatcac 


cttgaatctg 


cttgtcctct 


taatcttcct 


1800 
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tcyyya.CL.tc 


tyrtyaay t tc 


a 1^ a a a a +~ a a 
t ay dadrt trtrt 
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z o o u 
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aaa ^ ^ a ^ ^ 

C tdaaCCdC t 


gaaadddC td 


Z u 


cgctttgata 


ttgaaggatc 


agatgaagca 


gatggaagta 


aacatctccc 


aggagagtcc 


2700 


aaatttcagc 


agaaactggc 


agaaatgact 


tctactcgaa 


cacgaatgca 


aaagcagaaa 


2760 


atgaatgata 


gcatggatac 


ctcaaacaag 


gaagagaaat 


gaggatctca 


ggaccggcgg 


2820 


ccgc 
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<210> 19 

<211> 928 

<212> PRT 

<213> Homo sapiens 

<400> 19. 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 



Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 



Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 



61 



Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 



Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 



Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys Lys Lys 
85 90 95 



Glu Leu Trp Gly He Cys He Phe He Ala Arg Val Asp Leu Asp Glu 
100 105 110 



Met Ser Phe Thr Leu Leu Ser Tyr Arg Lys Thr Tyr Glu He Ser Val 
115 120 125 



His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys Val 
130 135 140 



Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
145 150 155 160 



Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu Thr Gin 
165 170 175 



Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val Leu Lys 
180 185 190 



Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met 
195 200 205 



Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val Leu Asp 
210 215 220 



Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys 
225 230 235 240 



Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly 
245 250 255 



Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg 
260 265 270 



He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val 
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275 



280 



285 



Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe Met Asn Ser Leu Gly 
290 295 300 



Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg 
305 310 315 320 



Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Arg Arg Leu Phe 
325 330 335 



Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe Glu 
340 345 350 



Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn lie 
355 360 365 



lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie Gin Gin 
370 375 380 



Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu 
385 390 395 400 



lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser lie Leu 
405 410 415 



Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys Phe Ala Lys 
420 425 430 



Ala Val Gly Gin Gly Cys Val Glu lie Gly Ser Gin Arg Tyr Lys Leu 
435 440 445 



Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu 
450 455 460 



Glu Glu Arg Leu Ser lie Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn 
465 470 475 480 



lie Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala 
485 490 495 



Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu 
500 505 510 
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Ser Phe Pro Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe 
515 520 525 



Tyr Lys Val lie Glu Ser Phe lie Lys Ala Glu Gly Asn Leu Thr Arg 
530 535 540 



Glu Met lie Lys His Leu Glu Arg Cys Glu His Arg lie Met Glu Ser 
545 550 555 560 



Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie Lys Gin Ser 
565 570 575 



Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu 
580 585 590 



Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser 
595 600 605 



Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser 
610 615 620 



Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys 
625 630 635 640 



Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg 
645 650 655 



Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu 
660 665 670 



His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His Thr Leu 
675 680 685 



Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin lie Met 
690 695 700 



Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie Asp Leu Lys 
705 710 715 720 



Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin 
725 730 735 
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Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr Asp Ser lie 
740 745 750 



lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn lie 
755 760 765 



Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie Pro His 
770 775 780 



He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro 
785 790 795 800 



Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser 
805 810 815 



Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu 
820 825 830 



Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys He 
835 840 845 



Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu 
850 855 860 



Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu 
865 870 875 880 



Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys 
885 890 895 



Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin 
900 905 910 



Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 

920 925 





915 


<210> 


20 


<211> 


1010 


<212> 


DNA 


<213> 


Homo 


<400> 


20 



gaattcgccg ccaccatgac catggactct ggagcagaca accagcagag tggagatgca 
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gctgtaacag 


aagctgaaaa 


ccaacaaatg 


acagttcaag 


cccagccaca 


gattgccaca 


120 


ttagcccagg 


tatctatgcc 


agcagctcat 


gcaacatcat 


ctgctcccac 


cgtaactcta 


180 


gtacagctgc 


ccaatgggca 


gacagttcaa 


gtccatggag 


tcattcaggc 


ggcccagcca 


240 


tcagttattc 


agtctccaca 


agtccaaaca 


gttcagattt 


caactattgc 


agaaagtgaa 


300 


gattcacagg 


agtcagtgga 


tagtgtaact 


gattcccaaa 


agcgaaggga 


aattctttca 


360 


aggaggcctt 


ccttcaggaa 


aattttgaat 


gacttatctt 


ctgatgcacc 


aggagtgcca 


420 


aggattgaag 


aagagaagtc 


tgaagaggag 


gcttcagcac 


ctgccatcac 


cgctgtagcg 


480 


gtgccaacgc 


caatttaccg 


gactagcagt 


ggacagtata 


ttaccattac 


ccagagagga 


540 


gcaatacagc 


tggctagcaa 


tggtaccgat 


ggggtacagg 


gcctgcaaac 


attaaccatg 


600 


gccaatgcag 


cagccactca 


gccgggtact 


accattctac 


agtatgcaca 


gaccactgat 


660 


ggacagcaga 


tcttagtgcc 


cagcaaccaa 


gttgttgttc 


aagctgcctc 


tggagacgta 


720 


caaacatacc 


agattcgcac 


agcacccact 


agcactattg 


cccctggagt 


tgttatggca 


780 


tcctccccag 


cacttcctac 


acagcctgct 


gaagaagcag 


cacgaaagag 


agaggtccgt 


840 


ctaatgaaga 


acagggaagc 


agctcgtgag 


tgtcgtagaa 


agaagaaaga 


atatgtgaaa 


900 


tgtttagaaa 


acagagtggc 


agtgcttgaa 


aatcaaaaca 


agacattgat 


tgaggagcta 


960 


aaagcactta 


aggaccttta 


ctgccacaaa 


tcagattaat 


ttgggtcgac 




1010 



<210> 21 

<211> 326 

<212> PRT 

<213> Homo sapiens 

<400> 21 

Met Thr Met Asp Ser Gly Ala Asp Asn Gin Gin Ser Gly Asp Ala Ala 
15 10 15 

Val Thr Glu Ala Glu Asn Gin Gin Met Thr Val Gin Ala Gin Pro Gin 
20 25 30 

lie Ala Thr Leu Ala Gin Val Ser Met Pro Ala Ala His Ala Thr Ser 
35 40 45 

Ser Ala Pro Thr Val Thr Leu Val Gin Leu Pro Asn Gly Gin Thr Val 
50 55 60 

Gin Val His Gly Val He Gin Ala Ala Gin Pro Ser Val He Gin Ser 
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65 



70 



75 



80 



Pro Gin Val Gin Thr Val Gin He Ser Thr He Ala Glu Ser Glu Asp 
85 90 95 



Ser Gin Glu Ser Val Asp Ser Val Thr Asp Ser Gin Lys Arg Arg He 
100 105 110 



Leu Ser Arg Arg Pro Ser Phe Arg Lys lie Leu Asn Asp Leu Ser Ser 
115 120 125 



Asp Ala Pro Gly Val Pro Arg He Glu Glu Glu Lys Ser Glu Glu Glu 
130 135 140 



Ala Ser Ala Pro Ala He Thr Ala Val Ala Val Pro Thr Pro He Tyr 
145 150 155 160 



Arg Thr Ser Ser Gly Gin Tyr He Thr He Thr Gin Arg Gly Ala He 
165 170 175 



Gin Leu Ala Ser Asn Gly Thr Asp Gly Val Gin Gly Leu Gin Thr Leu 
180 185 190 



Thr Met Ala Asn Ala Ala Ala Thr Gin Pro Gly Thr Thr He Leu Gin 
195 200 205 



Tyr Ala Gin Thr Thr Asp Gly Gin Gin He Leu Val Pro Ser Asn Gin 
210 215 • 220 



Val Val Val Gin Ala Ala Ser Gly Asp Val Gin Thr Tyr Gin He Arg 
225 230 235 240 



Thr Ala Pro Thr Ser Thr He Ala Pro Gly Val Val Met Ala Ser Ser 
245 250 255 



Pro Ala Leu Pro Thr Gin Pro Ala Glu Glu Ala Ala Arg Lys Arg Glu 
260 265 270 



Val Arg Leu Met Lys Asn Arg Glu Ala Ala Arg Glu Cys Arg Arg Lys 
275 280 285 



Lys Lys Glu Tyr Val Lys Cys Leu Glu Asn Arg Val Ala Val Leu Glu 
290 295 300 
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Asn Gin Asn Lys Thr Leu lie Glu Glu Leu Lys Ala Leu Lys Asp 
305 310 315 



Tyr Cys His Lys Ser Asp 
325 



<210> 22 

<211> 43 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 22 

cccgaattcg ccgccaccat gaccatggac tctggagcag aca 



<210> 23 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 23 

gtcgacccaa attaatctga tttgtggcag 
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